Skip to main content
USENIX
  • Conferences
  • Students
Sign in
  • FAST '14 Home
  • Conference Organizers
  • Registration Information
    • Registration Discounts
    • Venue, Hotel, and Travel
  • At a Glance
  • Calendar
  • Training Program
  • Technical Sessions
    • WiPs
  • Activities
    • Poster Sessions
    • Birds-of-a-Feather Sessions
  • Sponsorship
  • Students and Grants
  • Services
  • Questions?
  • Help Promote!
  • For Participants
  • Call for Papers
  • Past Conferences

sponsors

Platinum Sponsor
Gold Sponsor
Gold Sponsor
Gold Sponsor
Gold Sponsor
Gold Sponsor
Silver Sponsor
Bronze Sponsor
Bronze Sponsor
Bronze Sponsor
Bronze Sponsor
Bronze Sponsor
General Sponsor
General Sponsor
General Sponsor
General Sponsor
General Sponsor
General Sponsor
General Sponsor
Media Sponsor
Media Sponsor
Media Sponsor
Media Sponsor
Media Sponsor
Media Sponsor
Media Sponsor
Media Sponsor
Media Sponsor
Media Sponsor
Media Sponsor
Industry Partner
Industry Partner

twitter

Tweets by @usenix

usenix conference policies

  • Event Code of Conduct
  • Conference Network Policy
  • Statement on Environmental Responsibility Policy

You are here

Home ยป Introduction to Apache Hadoop and Its Ecosystem
Tweet

connect with us

http://twitter.com/usenix
https://www.facebook.com/pages/USENIX-Association/124487434386
http://www.linkedin.com/groups/USENIX-Association-49559/about
https://plus.google.com/108588319090208187909/posts
http://www.youtube.com/user/USENIXAssociation

Introduction to Apache Hadoop and Its Ecosystem

Half Day Morning
(9:00 am-12:30 pm)

Ballroom A

M1
Mark Grover, Cloudera, Inc.
Description: 

Originally inspired by Google's GFS and MapReduce papers, Apache Hadoop is an open source framework offering scalable, distributed, fault-tolerant data storage and processing on standard hardware. This session explains what Hadoop is and where it best fits into the modern data center. You'll learn the basics of how it offers scalable data storage and processing, some important "ecosystem" tools that complement Hadoop's capabilities, and several practical ways organizations are using these tools today. Additionally, you'll learn about the basic architecture of a Hadoop cluster and some recent developments that will further improve Hadoop's scalability and performance.

Who should attend: 

This session is intended for those who are new to Hadoop and are seeking to understand what Hadoop is, the ways that organizations are using it, and how it compares to and integrates with other systems. It assumes no prior knowledge of Hadoop, and explanations of technical topics like MapReduce and HDFS replication are clear and concise, making it appropriate for anyone attending the conference.

Topics include: 
  • What Hadoop is and how organizations are using it
  • How the HDFS filesystem provides reliability and high throughput
  • How MapReduce enables parallel processing on large data sets
  • Explanations of some popular open source tools that integrate with Hadoop
  • Typical architecture of a Hadoop cluster
  • Considerations for hosting a Hadoop cluster
  • Emerging trends in the design and implementation of Hadoop

Platinum Sponsors

Gold Sponsors

Silver Sponsors

Bronze Sponsors

General Sponsors

Media Sponsors & Industry Partners

© USENIX

  • Privacy Policy
  • Contact Us