Skip to content

gauravparashar/symbiosis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

52 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Cloud and Big Data

Summary:

This is a repository of the code to be executed in the class.

Course Content:

SNo. Topic Classroom Hrs Lab Hrs
1 Big Data Overview 2 0
2 Big Data Analytics in Industry Verticals 4 0
3 Analytics for Unstructured Data 4 8
4 The Hadoop Ecosystem 2 8
5 Principles of Cloud Computing 2 0
 6 Hybrid Cloud Management 3 2
7 Cloud Web Services 4 6
 Total 21 24

Reference Books

https://www.oreilly.com/programming/free/files/a-whirlwind-tour-of-python.pdf

For Visualizations we will use Anaconda

Download anaconda for Python 3.7 and choose your own distribution from here: https://www.anaconda.com/distribution/

Datasets:

Source: India State of Forest Report, 2011; **Land Use Statistics, Ministry of Agriculture, GOI, 2008-09; Area is in thousands Hectares (ha)

https://data.gov.in/resources/forest-cover-change-matrix-himachal-pradesh-2013

Details on District-wise forest cover for States/Uts. Forest Cover refers to all lands more than one hectare in area, with a tree canopy density of more than 10 percent irrespective of ownership and legal status. Such lands may not necessarily be a recorded forest area. It also includes orchards, bamboo and palm.

https://data.gov.in/resources/land-use-pattern-uttar-pradesh

Dataset for "Statistics and Social Network of YouTube Videos"

Column No. Name Description
1. video ID an 11-digit string, which is unique
2. uploader a string of the video uploader's username
3. age an integer number of days between the date when the video was uploaded and Feb.15, 2007 (YouTube's establishment)
4. category a string of the video category chosen by the uploader
5. length an integer number of the video length
6. views an integer number of the views
7. rate a float number of the video rate
8. ratings an integer number of the ratings
9. comments an integer number of the comments
10. related IDs up to 20 strings of the related video IDs

https://netsg.cs.sfu.ca/youtubedata/

Home Work Question 1:Telecom

https://www.tatateleservices.com/downloads/WhitePapers/resources/Big-Data-and-the-Telecom-Industry.pdf

Cloud Computing

  1. https://www.researchgate.net/publication/304418741_Application_of_cloud_computing_services_in_business

  2. https://event.on24.com/eventRegistration/console/EventConsoleApollo.jsp?&eventid=1638863&sessionid=1&username=&partnerref=&format=fhaudio&mobile=&flashsupportedmobiledevice=&helpcenter=&key=20E0FA32E42181D81EF3C47089ADB0AF&newConsole=false&nxChe=false&text_language_id=en&playerwidth=748&playerheight=526&eventuserid=275170966&contenttype=A&mediametricsessionid=230456274&mediametricid=2343496&usercd=275170966&mode=launch

  3. https://www.scribd.com/document/50992268/Kyne-Tix-Cloud-Computing-Strategy-Guide

  4. https://www.icmrindia.org/casestudies/catalogue/IT%20and%20Systems/ITSY077.htm

  5. https://practice.geeksforgeeks.org/problems/difference-between-cloud-and-big-data

About

Source Code for Cloud and Big Data course

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published