Skip to main content

The Linux Foundation, ODPi and edX Announce New, Free Intro to Apache Hadoop Course

By 2017-03-308月 22nd, 2017Press Release

Massive Open Online Course (MOOC) to provide students with basic knowledge of the leading Big Data processing software

SAN FRANCISCO – March 30, 2017 – The Linux Foundation, the nonprofit advancing professional open source management for mass collaboration, today announced its newest massive open online course (MOOC) is available for registration. The course, LFS103x – Introduction to Apache Hadoop, is offered through edX, the nonprofit online learning platform launched in 2012 by Harvard University and Massachusetts Institute of Technology (MIT). This free course will begin in early June.

This is the fifth edX MOOC offered by The Linux Foundation. Its first course, Intro to Linux, has reached more than 800,000 students globally and continues to grow in registrations. The others are Intro to Cloud Infrastructure Technologies, Introduction to OpenStack and Introduction to DevOps: Transforming and Improving Operations.

Apache Hadoop is an open source project used for distributed processing of large sets of data. It is used by organizations large and small around the world to manage and analyze the massive amounts of data being created every single second of every day. A large number of additional open source software projects exist that can be installed and run on top of or alongside it to provide additional functionality. According to 451 Research, Hadoop will advance at a 38 percent compound annual growth rate (CAGR) through 2020 and reach $4.4 billion in revenue by 2020.

At the same time, the demand for individuals who have experience managing this platform is also accelerating. According to the IT Skills and Certifications Pay Index research from Foote Partners, “the need for big data skills also continues to lead to pay increases – about 8 percent over the last year,” making this an ideal time for individuals to start a career managing Big Data with Apache Hadoop.

“As innovation across the Hadoop landscape continues to skyrocket, we’re thrilled to provide accessible, vendor-neutral education for the Big Data community,” said ODPi’s Director, John Mertic. “ODPi is committed to reducing ecosystem complexity and, with Roman Shaposhnik leading this ‘Introduction to Apache Hadoop’ edX course, we look forward to sharing insights that make Hadoop manageable for organizations of all sizes.”

LFS103x is taught by Hadoop experts from The Linux Foundation’s ODPi project, which is committed to simplification and standardization of the big data ecosystem with common reference specifications and test suites. Shaposhnik, VP of Technology for ODPi at The Linux Foundation and the course instructor, is also a committer on Apache Hadoop, co-creator of Apache Bigtop, and contributor to various other Hadoop ecosystem projects. He is also an Apache Software Foundation member and a former Chair of Apache Incubator.

Students in the course will learn:

  • The origins of Apache Hadoop and its big data ecosystem

  • Deploying Hadoop in a clustered environment of a modern day enterprise IT

  • Building data lake management architectures around Apache Hadoop

  • Leveraging the YARN framework to effectively enable heterogeneous  analytical workloads on Hadoop clusters

  • Leveraging Apache Hive for an SQL-centric view into the enterprise data lake

  • An introduction to managing key Hadoop components (HDFS, YARN and Hive) from the command line

  • Securing and scaling your data lakes in multi-tenant enterprise environments

“In today’s high-tech world, more data is created every day and increasingly organizations need professionals qualified to analyze it,” said edX CEO and MIT Professor Anant Agarwal. “We are pleased to again partner with The Linux Foundation to increase access to in-demand education, helping to bring Apache Hadoop expertise into the Big Data industry.”

The course includes six chapters, each with a short graded quiz at the end. A final exam is also required in order to complete the course. Students may take the complete course at no cost, or add a verified certificate of completion for $99.

For more information on The Linux Foundation’s training and certification programs, please visit:

About The Linux Foundation

The Linux Foundation is the organization of choice for the world’s top developers and companies to build ecosystems that accelerate open technology development and commercial adoption. Together with the worldwide open source community, it is solving the hardest technology problems by creating the largest shared technology investment in history. Founded in 2000, The Linux Foundation today provides tools, training and events to scale any open source project, which together deliver an economic impact not achievable by any one company. More information can be found at

# # #

The Linux Foundation has registered trademarks and uses trademarks. For a list of trademarks of The Linux Foundation, please see our trademark usage page:

Linux is a registered trademark of Linus Torvalds.

Apache, Bigtop and Hadoop are registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries.

The Linux Foundation
Follow Us