O’Reilly Datashow: Datalake and connecting dots
Contents
The following are the learnings from the podcast.
- Data lake dream
- Make data usable
- Building an API that helps the user consume data
- Don’t build any data with our making it available for any organisation
- Cloudera - Espouses data lake
- Need to get the search strategy right
- Hadoop is similar to Linux
- Free Linux- quick iteration, scale up, go mobile
- Following Spark
Connecting dots
- Spatial temporal pattern
- Used to study sports