Python has tools for all stages of the life cycle of a data science project. Any data science project has the following 3 stages inherently included in it. Data Collection, Data Modelling, Data Visualization, Python provides very neat tools for all 3 of these stages. Data Collection, 1) Beautiful Soup
When data collection involves scraping data off of the web, python provides a library called beautifulsoup. This library parses a web page and stores its contents neatly. For example, it will store the title separately. It will also store all the <a> tags separately which will provide you with very neat list of URLs contained within the page. As an example let us look at a simple web page for the story of Alice’s Adventure in Wonderland.
Author: Rishi Sidhu