Lab 5 – imdb's most frequent co-stars

2 pages - 188.5 KB
Of movies that each pair of actors/actresses have co-starred in. the output should look something like: de niro, robert##pacino, al.
Document in text mode:
Lab5–IMDB’sMostFrequentCo-starsCC5212-1April9,2015Nomorecountingwords!Todaywecountthestars.Morespecifically,wewillcountco-starsinmoviesthatarelistedinIMDb.1Yourmission...shouldyouchoosetoacceptit...istocreateasetofMapReducejobsthatcountthepairsofactorsoractressesthathaveco-starredinthemostmoviestogether.Hopefullyitshouldn’tbeimpossible.ThegoalofthelabistodemonstratethatyoucannowindependentlycodeMapReducejobstoprocesslargeamountsofdata.SincewedidsomethingverysimilarontheboardinthelectureonMondayandsincelastWednesday,wealreadysawthesyntaxandstepsneededtocodeandrunaMapReducejob,inthislabI’mnotgoingtogiveyoudetailedstep-by-stepinstructions.Instead,I’llgiveyoutheinstructionstogetyoustartedandwillpointyoutothematerialthatwillhelpyoucompletethelab.Therestwillbeuptoyou.•Theinstructionsforloggingintotheserver,foraccessinganduploadingdatatoHDFS,forbuildingyourcodeandrunningit,etc.areavailablefromlastweeks’instructions:http://aidanhogan.com/teaching/cc5212-1/doc/lab4.pdf.•ThedatayouneedareonHDFSin...