Intermediate Big Data
Curriculum
Wander from the Source to the Sea of Data
Web Scraping
Web Scraping is an essential component of modern data science field. It is well-known that most of the useful data are only reachable in the form of a webpage but not a well-defined data API. In this section , we will guide students to use Puppeteer, the powerful web-scraping tools, to extract information from popular websites including even Single page application. Students will also learn how to work with the most popular programming environment Node.JS.
It cover in-depth knowledge of the following:
- Node Environment
- Node Packages
- Puppeteer
- Case Studies with real examples
NoSQL Database
Firebase is a NoSQL document-oriented cloud database that allows coders to store the data in a scalable and stable platform with minimal configuration. It is known for its ease-of-use in particular in the fields of unstructured data.
It covers in-depth knowledge of the following:
- Firebase
- Accessing firebase with Node
- Storing scraped data to Firebase
Basic Python
Python is the most popular programming language for working with data science. In this section, students are going to learn how to setup the environment and development tools to start their journey in Python and Data Science. Students are also going to learn the basic and advanced python such that they are able to work with the libraries in Python afterwards.
It covers in-depth knowledge of the following:
- Python environment setup
- Python Development tools
- Basic and Advanced Python
Basic Data Science
There are lots of data science libraries to make working with data an easier job. With the help of these libraries, students can easily extract, cleanse and visualise the data stored in the cloud databases. Students will learn how to post-process, analyze and present the data utilizing the libraries in this section.
This course covers the in-depth knowledge of the following:
- Numpy - Matrix and Tensor manipulation tools
- Pandas - Multi-format data processing tools
- Seaborn - Statistical visualisation tools
- Matplotlib - 2D plotting graph