Table A - The data from IMDB website.
Table B - The data from TheNumbers website.
Table M - All matches between table A and B, mentioned with ID and title.
Table E - Final integrated table using matches from M
- The number of tuples are 4291.
- The attributes store movie information are
- the title of the movie,
- year of release,
- mpaa column refers to the rating by Motion Pictures Association of America,
- the length of the movie,
- genre it belongs to,
- star rating in IMDB,
- metascore rating which is indicative of the reviews of critics,
- a short description of the movie,
- director for the film,
- the stars or the leading actors in the movie, and
- the total earnings worldwide.
- The number of tuples are 31006.
- The attributes store the same information as in imdb2.csv except missing values for columns like stars(i.e actors) , description etc