Навыки, которые вы получите:
Social network analysis
BigData
Google ads
Analytics
MapReduce
Learn how big data is driving organisational change and essential analytical tools and techniques, including data mining and PageRank algorithms.
The Program
- The basics of working with big data. Understand the four V’s of Big Data (Volume, Velocity, and Variety); Build models for data; Understand the occurrence of rare events in random data.
- Web and social networks. Understand characteristics of the web and social networks; Model social networks; Apply algorithms for community detection in networks.
- Clustering big data. Clustering social networks; Apply hierarchical clustering; Apply k-means clustering.
- Google web search. Understand the concept of PageRank; Implement the basic; PageRank algorithm for strongly connected graphs; Implement PageRank with taxation for graphs that are not strongly connected.
- Parallel and distributed computing using Map. ReduceUnderstand the architecture for massive distributed and parallel computing; Apply MapReduce using Hadoop; Compute PageRank using MapReduce.
- Computing similar documents in big data. Measure importance of words in a collection of documents; Measure similarity of sets and documents; Apply local sensitivity hashing to compute similar documents.
- Products frequently bought together in stores. Understand the importance of frequent item sets; Design association rules; Implement the A-priori algorithm.
- Movie and music recommendations. Understand the differences of recommendation systems; Design content-based recommendation systems; Design collaborative filtering recommendation systems.
- Google’s AdWordsTM System. Understand the AdWords System; Analyse online algorithms in terms of competitive ratio; Use online matching to solve the AdWords problem.
- Mining rapidly arriving data streams. Understand types of queries for data streams; Analyse sampling methods for data streams; Count distinct elements in data streams; Filter data streams.
What you’ll learn
- Knowledge and application of MapReduce.
- Understanding the rate of occurrences of events in big data.
- How to design algorithms for stream processing and counting of frequent elements in Big Data.
- Understand and design PageRank algorithms.
- Understand underlying random walk algorithms.