Python Tools for a Beginner Data Scientist

Python has tools for all stages of the life cycle of a data science project. Any data science project has the following 3 stages inherently included in it. Data Collection, Data Modelling, Data Visualization, Python provides very neat tools for all 3 of these stages. Data Collection, 1) Beautiful Soup

When data collection involves scraping data off of the web, python provides a library called beautifulsoup. This library parses a web page and stores its contents neatly. For example, it will store the title separately. It will also store all the <a> tags separately which will provide you with very neat list of URLs contained within the page. As an example let us look at a simple web page for the story of Alice’s Adventure in Wonderland.

Author: Rishi Sidhu

Leave a Reply

Fill in your details below or click an icon to log in: Logo

You are commenting using your account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s