Skip to Content
Part-time

Intermediate Big Data

Intermediate Big Data

Upcoming cohorts

According to recent surveys, there are approximately 200 millions active websites in the internet. That is why there is abundance of available data, what most are lacking instead, are the means to extract, process and analyze these data. In this course, we cover three crucial topics for any data task: Data extraction, Data storage and Data processing. Both JavaScript and Python libraries are utilized in the course to tackle different problems in the entire data processing pipeline.

This course is ideal for students who are seeking next level of understanding after the introductory level of programming.

Students attending this course will learn the following:

  • Scraping and extracting data from public webpage using powerful Web-scraping tools
  • Storing and consolidating data in cloud NoSQL database
  • Learning fundamental Python knowledge to work with data
  • Utilizing basic Python Data Science libraries to analyze and visualize unstructed data

Curriculum

Web Scraping

Web Scraping

Web Scraping is an essential component of modern data science field. It is well-known that most of the useful data are only reachable in the form of a webpage but not a well-defined data API. In this section , we will guide students to use Puppeteer, the powerful web-scraping tools, to extract information from popular websites including even Single page application. Students will also learn how to work with the most popular programming environment Node.JS.

It cover in-depth knowledge of the following:

  • Node Environment
  • Node Packages
  • Puppeteer
  • Case Studies with real examples
JavaScript
Puppeteer
NoSQL Database

NoSQL Database

Firebase is a NoSQL document-oriented cloud database that allows coders to store the data in a scalable and stable platform with minimal configuration. It is known for its ease-of-use in particular in the fields of unstructured data.

It covers in-depth knowledge of the following:

  • Firebase
  • Accessing firebase with Node
  • Storing scraped data to Firebase
Firebase
Basic Python

Basic Python

Python is the most popular programming language for working with data science. In this section, students are going to learn how to setup the environment and development tools to start their journey in Python and Data Science. Students are also going to learn the basic and advanced python such that they are able to work with the libraries in Python afterwards.

It covers in-depth knowledge of the following:

  • Python environment setup
  • Python Development tools
  • Basic and Advanced Python
Python
Basic Data Science

Basic Data Science

There are lots of data science libraries to make working with data an easier job. With the help of these libraries, students can easily extract, cleanse and visualise the data stored in the cloud databases. Students will learn how to post-process, analyze and present the data utilizing the libraries in this section.

This course covers the in-depth knowledge of the following:

  • Numpy - Matrix and Tensor manipulation tools
  • Pandas - Multi-format data processing tools
  • Seaborn - Statistical visualisation tools
  • Matplotlib - 2D plotting graph
Matplotlib
NumPy

Instructor team

Alex Lau

Alex Lau

Lead Instructor

Full Stack
Mobile App

Alex is recognized by a series of IT Awards (ICT Grand Award, HSBC Youth Business Award, etc) for his professionalism and technology innovation. Being the former CTO of Play More Limited and the Technology Advisor of several companies, he has vast experience in the software development, project management and technology solution fields. With over 15 years of programming experience, Alex is proficient in over 10 programming languages (C, C#, JS, Python, React ...). He is also passionate in learning, teaching and programming. In order to survive in the everchanging and competitive digital age, Alex had always adapting and learning up-to-date teaching approaches for tech learners. Over the years, he had successfully trained 300+ students to become software developers and oversaw several corporate training programs such as Swire Coca Cola and HKTDC. He has a vision to raise the standard and competitiveness of the entire IT coaching industry.

Gordon Lau

Gordon Lau

Curriculum Director

Full Stack
Data Science

Gordon's experience includes multiple roles of software development and leadership for different companies across various industries. With over 10 years' experience in professional programming and 4 years' experience in technology education, Gordon has mentored over 300+ newcomers to break into the technology sector. He was also the developer of the chatroom application HKGChat which acquired over 3,000 users on its launch day. He is additionally the principal developer of Tecky Code, which is Hong Kong's first open to public programming learning platform. As a firm believer in developing the future of Hong Kong’s IT industry, he has been promoting the importance of programming in mainstream education. Gordon is also an avid foreign language, science and travel enthusiast.

  • Next start date
  • Time 19:00 - 21:30 Every Tuesday and Thursday
  • Duration 6 weeks
  • Class size
  • Location The Wave Mongkok
Intermediate Big Data
Request Syllabus
Please check your email after submissions.
hello@tecky.iot.me/TeckyAcademy
+852 9725 6400+44 07570 549084+1 (647) 556-3084
Caring Company 2019-2022
TQUK Approved Centre
aws_partner
Reimagine Education Challenge Award
© 2022 Tecky Academy Limited