Github and Docker Repos
View on Github
View on Docker Hub
The goal of this workshop is to use the PipelineIO platform to build an end-to-end, streaming data analytics and recommendations pipeline on a cloud instance (provided by us) with Docker and the latest streaming analytics and machine learning tools available.
- First, we create a data pipeline to interactively analyze, approximate, and visualize streaming data with modern tools such as Apache Spark, Kafka, Zeppelin, iPython, Redis, Parquet, ElasticSearch, Cassandra, Presto, Flink, NiFi, and TensorFlow.
- Next, we extend our pipeline to use streaming data to generate personalized recommendation models with popular machine learning, graph, and natural language processing techniques such as collaborative filtering, clustering, and topic modeling.
- Last, we productionize our pipeline and serve live predictions and recommendations to our users.
At the end of the workshop, the attendee can download their live, running Docker Container to their local machine as a .tar file.Architecture Overview
- San Francisco: Saturday, Apr 23rd (SOLD OUT)
- San Francisco: Saturday, June 4th (SOLD OUT)
- Washington DC: Saturday, June 18th (SOLD OUT)
- Los Angeles: Sunday, July 10th (SOLD OUT)
- Seattle: Saturday, July 30th (SOLD OUT)
- Santa Clara: Saturday, August 6th (SOLD OUT)
- Chicago: Saturday, August 27th (SOLD OUT)
- New York: Saturday, October 1st (SOLD OUT)
- Munich: Saturday, October 15th (SOLD OUT)
- London: Saturday, October 22nd (SOLD OUT)
- Brussels: Saturday, October 29th (SOLD OUT)
- Madrid: Saturday, November 19th (SOLD OUT)
- Bangalore: Saturday, December 10th (SOLD OUT)
- London: Coming Soon
- Tokyo: Coming Soon
- Shanghai: Coming Soon
- Beijing: Coming Soon
- Sydney: Coming Soon
- Melbourne: Coming Soon
- Sao Paulo: Coming Soon
- Rio de Janeiro: Coming Soon
Suggest Your City!