Key Points |
1) data manipulation Databases and the relational algebraMapReduce, Hadoop, relationship to databases, algorithms, extensions, language; key-value stores and NoSQL; tradeoffs of SQL and NoSQLData cleaning, entity resolution, data integration, information extraction 2) analytics Basic statistical modeling, experiment designIntroduction to Machine Learning, supervised learning, decision trees/forests, simple nearest neighborUnsupervised learning: k-means, multi-dimensional scaling 3) interpreting and communicating results Visualization, visual data analyticsEthics, privacy | |
Further Links | https://class.coursera.org/datasci-001/lecture/preview | |
Related Visuals | http://blogs.gartner.com/doug-laney/deja-vvvue-others-claiming-gartners-volume-velocity-variety-construct-for-big-data/ | |
Webinar Category | Data Science | |
Title | Introduction to Data Science | |
Presenter_Name | Howe, Bill | |
Provider | Coursera | |
Partner | University of Washington | |
Runtime |
Comments
i like your blog very much Data Science Online course
ReplyDelete