This is not an extensive theory/practice course trying to touch each and every aspect of a concept: web scraping with Scrapy.
It is a dedicated course to help you gain a practical skill: how to write Scrapy Middleware to solve common web scraping problems on your own.
It achieves this in a complete manner. So it includes theory first, followed by application through case studies.
Hi!
Web Scraping has become an indispensible step of data science for developers who don't want to to replicate but create.
Like in many fields within coding, it is usually not too hard to learn and understand initial concepts.
And successfully complete examples within those popular courses.
..."Yes, you got that right, too, there you go!", "congrats, now proceed to the next concept..."
But when it comes to solving indigenous problems.
When it comes to creating on your own.
You feel that the simple theory/practice methodology does not do the job.
Yes you have that perfect request line, and you efficiently pipelined parsed items to the correct folder/database.
The first pages are retrieved flowlessly, but then...
But then...what happened?
You start getting 503, and maybe anything but the desired 200.
Yes you are banned!
Everything you have learned becomes useless at that moment.
Of course, It is not a hopeless situtation.
There are few ways to handle this.
You may stackoverflow!
They will ask your code, and than you will do what they say,
Sometimes it will work...
Here is the thing,
Whatif I tell you, although you might not be 'pro' in web scraping,
In few hours you can learn to write your own middleware to tackle difficult web scraping problems.
Those problems that you will for sure encounter,
Maybe not in the first, but definitly in your second web scraping attempt.
Yes, in 3 hours, I will show you how you can intutively create problem solver middlewares in Scrapy.
This will require deep knowledge of Scrapy Architecture.
A knowledge of flow and interactions of 4 main entities within Scrapy.
The engine, the scheduler, the middlewares and of course the spider object.
So this course has 2 main parts.
'Scrapy Architecture Deep Dive' and 'Creating Middleware'.
Both parts have two main sections. They start with corresponding theory section followed by a Case Study section to apply the theory.
Yes the course is specific, but the capability you gain will be general.
With this course, you will have a reach to the most intuitive explanation of Scrapy Architecture and how to create a problem-solver middleware in Scrapy, not excluding 2.x versions of this framework.
See you in the lessons.
Tarkan Aguner