NICF- Big Data Engineering for Analytics

NUS Institute of Systems Science

How long?

  • 5 days
  • in person

What are the topics?

NUS Institute of Systems Science


Coursalytics is an independent platform to find, compare, and book executive courses. Coursalytics is not endorsed by, sponsored by, or otherwise affiliated with any business school or university.

Full disclaimer.

Read more about Business Analytics

Business Analytics courses will introduce you to a popular and diverse profession. A business analyst is a specialist in many IT fields as well as in ...


Comprehensive course analysis

Unbiased reviews from past participants
Global companies alumni of this course worked for
Positions of participants who took this course
Countries where most past participants are from
Individual needs analysis

Who should attend

This is an intermediate course, suitable for professionals with some experience in any programming language and data design. If the participants have some business exposure, they can appreciate the case studies discussed better.

This course targets analytics professional including:

  • Business and IT professionals seeking analytical skills to handle large amounts of unstructured data (Data lake e.g. customer feedbacks, product reviews on social media, phone call recordings, etc.) for insights to improve business process and decision-making.
  • Individuals who have no knowledge or experience in data engineering for analytics and would like to gain some practical skills in this area so that they may explore work opportunities in data engineering.
  • Data analysts and Data Engineers, who want to move from the structured to large amounts of unstructured data engineering.


This is an intensive, intermediate course. Our proposed course targets the higher value chain professionals such as data engineers, data application architects, integration architects, software engineers working on data pipeline processing and key technology decision makers.

Participants with experience in programming languages such as Python or Java or Scala will benefit more from the course. Participants also need to have a strong interest in building functional pipelines and be comfortable working with Hadoop platform and Spark framework.

NUS-ISS also offers a range of other basic courses in analytics for participants new to analytics

About the course

This 5-day course helps data engineers focus on essential design and architecture while building a data lake and relevant processing platform.

Participants will learn various aspects of data engineering while building resilient distributed datasets. Participants will learn to apply key practices, identify multiple data sources appraised against their business value, design the right storage, and implement proper access model(s). Finally, participants will build a scalable data pipeline solution composed of pluggable component architecture, based on the combination of requirements in a vendor/technology agnostic manner. Participants will familiarize themselves on working with Spark platform along with additional focus on query and streaming libraries.

This course is part of the Analytics and Intelligent Systems series offered by NUS-ISS.

Key Takeaways

Upon effective completion of the course, participants will be able to:

  • Understand the growth of big data and need for a scalable processing framework. Understand the fundamental characteristics, storage, analysis techniques and the relevant distributions
  • Understand the distributed storage essentials, storage needs, and relevant architectural mechanism in processing large amounts of structured, semi-structured and unstructured data.
  • Gain expertise with the fault-tolerant computing framework (E.g. YARN) by setting up pseudo cluster nodes or cloud based nodes for processing big data. .
  • Construct configurable and executable tasks using the In Memory Processing frameworks (E.g. Spark Core). Understand the nuances of writing functional programs and use the core libraries to manipulate the large corpse of unstructured data residing as Resilient Distributed Datasets.
  • Organize, store and manipulate the collected data using processing libraries. For example, using special statistical operation and stream processing data tools (E.g. Spark Special Libraries).
  • Understand various data processing, querying and persistence (E.g. Spark QL APIs) available for usage in RDD’s context. Perform tasks such as filtering, selection and categorization.

What Will Be Covered

The course objective is to explore the engineering aspects of big data storage, querying and processing techniques. The course aims to teach the students to apply the newly acquired proficiencies by developing data intensive applications using distributed compute platform (e.g. using the Hadoop platform, Spark Framework and relevant tools).

A brief module description is provided below:


Module 1: Introduction to Data Science, Data Engineering and Big Data

Module 2: Understand Big Data from an Analytics Perspective

Module 3: Architectural Viewpoints in Big Data

Module 4: The Hadoop Ecosystem for Big Data

Module 5: Distributed File Storage

Module 6: NoSQL Databases for Big Data

Module 7: Spark and Functional Programming for Big Data

Module 8: Spark and Resilient Distributed Data Sets

Module 9: Spark QL for Big Data

Module 10: Spark and Real Time Stream Processing

Module 11: Management of Big Data initiatives

Discussion and Project Requirement Elaboration

Project and Assessment

  • Project Demonstration, Report Submission and Presentations. Each team will work on a practical case study and submit/present their work done regarding the assigned Big Data project.

Closing Remarks


Suriya Priya Asaithambi

Suria has twenty years of teaching and consulting experience in areas such as software engineering, application architecture, crafting cloud services, agile development and big data engineering. Her research interest spans around cloud computing, software engineering, test automation and big dat...

Liu Fan

Liu Fan currently lecturers in the Software Systems Practice in the areas of software engineering, big data engineering and data analytics. She received her Ph.D Ph.D. degree from School of Electrical and Electronic Engineering, Nanyang Technological University, Singapore. Prior to joining ISS, ...

Venkat Ramanathan

Dr. Venkat Ramanathan has wide experience in the fields of IT and business process engineering. He has served industry and academia for over 26 years and has been instrumental in attracting businesses worth several millions through software consulting for clients across Asia, US, Europe and New Z...

NICF- Big Data Engineering for Analytics at NUS Institute of Systems Science

From  SGD 4 815$3,643
Add coaching to your course booking

Coaching can personalize and deepen learning for you and your organization.

Something went wrong. We're trying to fix this error.

Thank you for your application

We will contact the provider to ensure that seats are available and, if there is an admissions process, that you satisfy any requirements or prerequisites.

We may ask you for additional information.

To finalize your enrollment we will be in touch shortly.


Coursalytics is an independent platform to find, compare, and book executive courses. Coursalytics is not endorsed by, sponsored by, or otherwise affiliated with any business school or university.

Full disclaimer.

Read more about Business Analytics

What will you learn from Business Analytics courses? First of all, you will learn about the profession of a business analyst, his duties, and what such a specialist does. You will get various soft skills, such as organizing teamwork, for example, acc...

Because of COVID-19, many providers are cancelling or postponing in-person programs or providing online participation options.

We are happy to help you find a suitable online alternative.