Beautiful Data
From WikiContent
The idea here is to get a book of industry experts in various aspects of Data acquisition, manipulation, streaming, serving, visualizing, displaying, protecting, understanding and show how each specific aspect has a beautiful nature to it. I would love to see an Appendix of all the great data sources that folks can acquire and use. The Appendix could be built through wiki-based URL submissions.
Beautiful Data
Clarity of vision in a world of increasing data volume and complexity. Novel perspectives, new capabilities. Increased understanding and efficiency. Expanding the scientific method into more extreme conditions.
Motivations and Background
Break data analysis into stages:
- collection
- storage
- organization
- retrieval
- visualization
- analysis
Recent books:
- The Numerati by Stephen Baker
- Competing on Analytics: The New Science of Winning by Thomas H. Davenport and Jeanne G. Harris
- Programming Collective Intelligence by Toby Segaran
Open source projects:
- hadoop
- hbase
- cassandra
- kfs
- lustre
- sge
- hypertable
Open data projects
- theinfo.org
- infochimps
