Analytics Experts Meet in St. Louis to Discuss High Performance Data Analytics

High Performance Data Analytics

Join us at this free STLhpc.net event on October 5th in St. Louis where analytics experts from Cloudera, St. Louis-based Appistry and EMC/Greenplum will join together to present for business executives, software developers and system administrators how to enable and leverage a high performance data analytics environment. Organizations in a growing number of industries including retail, health care, bioinformatics, financial services, scientific research, transportation and others are finding competitive advantages by increasing their ability to transform unprecedented amounts of available data into business insights and discoveries.

If you are just starting to investigate high performance data analytics, however, there can be a lot to know and a lot of options. Hadoop, HBase, Pig, Hive, HDFS, Appistry’s Ayrris, EMC’s Greenplum, Cloudera’s CDH, Map Reduce, NoSQL, and so on are just some of the methods and technology options available to help you leverage cost-effective compute and storage for your high performance data analytics. The presenters will provide clarification about where these components fit, how they relate to one another, how different industries are leveraging these tools and how you can get started using them as well.

Featuring:

  • Subramanian Kartik – Global Field CTO for the Data Computing Division at EMC
  • Michael Groner – Chief Architect at Appistry
  • Sultan Meghji – Vice President of Analytics Applications at Appistry
  • Michael Katzenellenbogen – Solutions Architect at Cloudera
  • Gary Stiehr – Founder, STLhpc.net; Information Systems Group Leader at The Genome Institute at Washington University in Saint Louis


Agenda:

Start End Session Description
8:00
8:30

Check-in/Registration, Coffee and Networking

8:30
8:45

St. Louis, HPC and High Performance Data Analytics

Gary Stiehr

An introduction to the possibilities of high performance data analytics and how the St. Louis High Performance Computing (HPC) community together with the St. Louis business community is poised for success.

8:45
9:30

Deploying High Performance Analytical Pipelines

Michael Groner & Sultan Meghji

Appistry helps customers deploy high-performance analytical applications in big-data industries like intelligence, defense, life sciences, financial services, and transportation. Appistry’s Ayrris solution is delivered via automated analytical pipelines. A pipeline is a structured workflow, specifically designed to automate the analysis and management of specific types of data. Hear about the insights Appistry has gained deploying high performance analytical pipelines across industries as well as several real-world customer success stories.
9:30 10:15

Introduction to Hadoop

Michael Katzenellenbogen

This presentation is an introduction to Apache Hadoop, covering the Apache Hadoop architecture, its compute paradigm and typically applied use cases. The presenter will also discuss the Apache Hadoop ecosystem, various tools, as well how one interacts with Hadoop from/to various other systems.

10:15 10:45

Food and Networking break

10:45 11:45

The N(ot) O(nly) SQL Movement!

Subramanian Kartik

The IT landscape has been buzzing with news about the latest technological advances in high performance computing around NOSQL technologies. The promise they hold is extraordinary scale in platforms that can handle very high performance demands on commodity hardware, with deep roots in Open Source. But are they really the panacea that some claim they are? Is the world of ACID and SQL truly aging and dying?

This talk examines the place of SQL and NOSQL technologies with a slant on high performance analytics for solving Big Data problems, summing up how these technologies are being used in real life. The fundamental paradigm shift of moving Compute close to Data to enable high-throughput computing is discussed. The advantages and challenges of Hadoop, and NOSQL stores like Cassandra, MongoDB etc will be examined in the context of analytics. Recent advances in MPP relational technologies such as Greenplum, with in-database query processing and machine learning support is contrasted to NOSQL stores.

11:45
12:30

How to Immediately Begin Using and Benefiting from Apache Hadoop

Michael Katzenellenbogen

This presentation is an in-depth discussion on the typical path used to deploy a new Apache Hadoop cluster. Attendees with a range of technical skills will learn how to immediately begin using and benefiting from Apache Hadoop.

About the Speakers:

Subramanian Kartik Global Field CTO for the Data Computing Division at EMC

Subramanian Kartik is the Global Field CTO for the Data Computing Division at EMC, and an EMC Distinguished Engineer. He has been active in Information Technology for the past 15 years, having worked extensively in both academic circles and industry, designing, deploying and managing large infrastructures for Fortune 1000 customers. His experience in EMC has spanned Advanced Business Continuity architecture, enterprise solutions and emerging technologies. His current interests include Business Intelligence, Cloud Computing, Enterprise Virtualization and High Performance Computing, where he has patents pending in the area of Grid Computing on Virtual Machines. Much of his investigations recently have been around the in-database analytics and NOSQL space, where, in addition to mathematical SQL based analytics in MPP databases, he has also been studying text analytics using Map/Reduce based techniques on Hadoop, and Natural Language Processing. He holds a Masters and Doctorate in Physics from Indiana University in High Energy Physics, with over 75 publications to his name based on research at Fermilab and DESY. He received his M.S in Physics (1984) in the 5-year program at the Indian Institute of Technology in Bombay.

Michael Groner – Chief Architect at Appistry

Michael Groner is an Appistry founder and is responsible for the technical architecture and vision behind Appistry products. Prior to co-founding Appistry, Michael served in various capacities at PaylinX and, after its acquisition, at CyberSource. One of the earliest PaylinX employees, Michael held the position of Chief Architect at the company. Michael led the development of several versions of the PaylinX Payment Server. Later, as Director of Software Development at CyberSource, Michael managed the integration of the PaylinX and CyberSource products and development teams. Michael began his career as a developer with Tapestry Computing where he designed and implemented PayPort 1.x, the foundation for the PaylinX concept. Michael holds bachelor’s and master’s degrees in computer science from Washington University in St. Louis.


Sultan Meghji – Vice President of Analytics Applications at Appistry

Sultan Meghji is an accomplished technical leader and entrepreneur. He began his career nearly twenty years ago at the National Center for Supercomputing Applications (NCSA), where he developed artificial intelligence systems and first generation Internet technologies. He is a recognized leader in Enterprise Architecture, Innovation and Strategy and has served in leadership positions in IT and Enterprise Architecture at ABN AMRO, American Express, Monsanto and United Airlines. He has held numerous CIO roles in academia and the private sector in areas as diverse as IT services, financial services and biotech. In addition, Sultan is an active volunteer for the IT Entrepreneurial Network (ITEN), Infragard and other St. Louis-centric groups. Sultan holds a Bachelors degree from the University of Illinois at Urbana Champaign.


Michael Katzenellenbogen – Solutions Architect at Cloudera

As a solutions architect at Cloudera, Michael works with customers to ensure successful and productive Apache Hadoop deployments. Prior to joining Cloudera, Michael held lead technical roles with a number of prominent NYC-based Fortune 100 companies. Michael has over 10 years experience in software development, systems architecture and data analytics.


Gary Stiehr – Founder, STLhpc.net; Information Systems Group Leader at The Genome Institute at Washington University in Saint Louis

Gary has over eleven years of experience in High Performance Computing (HPC) environments. He leads the Information Systems group at The Genome Institute at Washington University in St. Louis, which manages the HPC systems used to analyze the genetic basis of cancer and other diseases. Also, he has recently started STLhpc.net, which has quickly become the focal point for HPC activity in St. Louis and aims to contribute to growth in St. Louis through the application of High Performance Computing. Prior to joining The Genome Institute, Gary worked in the Computing Division at Fermi National Accelerator Laboratory (Fermilab) as part of the U.S. CMS Tier-1 Facilities team building and supporting the Tier-1 regional computing center as part of the Worldwide Large Hadron Collider Computing Grid. Prior to Fermilab, Gary led the development and adoption of high performance computing resources by University research groups at the University of Missouri – St. Louis.

Our Sponsors:

Thank you to EMC and Savvis for sponsoring the room and food for this STLhpc.net event.

This entry was posted in Events and tagged . Bookmark the permalink.

2 Responses to Analytics Experts Meet in St. Louis to Discuss High Performance Data Analytics

  1. Pingback: High Performance Analytics Experts to Meet in St. Louis Oct. 5 | insideHPC.com

  2. Pingback: High Performance Analytics Experts to Meet in St. Louis Oct. 5 | Inside-BigData.com

Comments are closed.