Assessment Title : Individual Project Development Brunel University London
MAIN OBJECTIVE OF THE ASSESSMENT In this assessment, you are required to demonstrate the appropriate practical skills and abilities to implement solutions using modern large-scale data storage and processing infrastructures, and to critically reflect on the concepts, theory and use of high performance computational infrastructures.
DESCRIPTION OF THE ASSESSMENT You are required to identify and analyse a real-world problem, design and implement a solution to the problem using Hadoop, and evaluate your implementation. The problem can be a simplified version from its original scale, extent or level of difficulties etc. An indicative list of sample problems have been provided at the end of this document. You may choose one of the problems in the list, but you are encouraged to identify your own problem for the project.
The assessment has two weighted components:
Learning Outcomes:
LO1: Demonstrate the appropriate practical skills/abilities required to implement solutions using modern large-scale data storage and processing infrastructures.
LO2: Reflect critically on the concepts, theory and appropriate use of large-scale data storage and processing infrastructures (commonly used in modern organisational environments).
Marking Criteria:
The coursework will be marked for 4 main criteria:
Grade Band E and F (E+, E, E-, F) The candidate fails to meet the minimum requirements as outlined in the learning outcomes.
Grade Band D (D+, D, D-) The work demonstrates significant weaknesses, but all of the learning outcomes have been met at the minimum requirement level. The work provides evidence of some critical understanding of the concepts and theories of large-scale data storage and processing infrastructures, and demonstrates some abilities and skills to implement solutions using these technologies.
Grade Band C (C+, C, C-) In addition to the requirements for a grade in D-band, the work demonstrates a critical and substantial understanding of the concepts and theories of large-scale data storage and processing infrastructures. It demonstrates the ability to develop an independent, systematic, logical and effective solution to the problems identified. It also demonstrates a significant degree of competence in the appropriate use of the relevant literature, theory, methodologies, practices, and tools, etc., to analyse the problems and evaluate the solutions.
Grade Band B (B+, B, B-) In addition to the requirements for a grade in C-band, the work clearly demonstrates a well-developed, critical and substantial understanding of the concepts and theories of large-scale data storage and processing infrastructures. It clearly demonstrates the ability to develop an independent, systematic, logical and effective solution to the problems identified. It also demonstrates a high degree of competence in the appropriate use of the relevant literature, theory, methodologies, practices, and tools, etc., to analyse the problems and evaluate the solutions.
Grade Band A (A*, A+, A, A-) In addition to the requirements for a grade in B-band, the work clearly demonstrates a sophisticated, critical and thorough understanding of the concepts and theories of large-scale data storage and processing infrastructures. It provides evidence of originality of thought and clearly demonstrates the ability to develop an independent, systematic, logical and effective solution to the problems identified. It also demonstrates excellence in the appropriate use of the relevant literature, theory, methodologies, practices, and tools, etc., to analyse the problems and evaluate the solutions.
FORMAT OF THE ASSESSMENT
There is no word/page limit for this assessment, but the best effort should be made to ensure the submission is as concise as possible. You should include sections on (percentage of overall mark):
In this assessment, you are required to identify and analyse a real-world problem, design and implement a solution to the problem using Hadoop, and evaluate your implementation. The problem can be a simplified version from its original scale, extent or level of difficulties etc. Please refer to the official assessment specifications for the objectives, descriptions, marking criteria, format and submission requirement of the coursework.
An indicative list of sample problems is given as below. You may choose one of the problems in the list, but it would be better, and you are most encouraged, to identify your own problem and provide your own solutions for the coursework.
1. Word counting. We have used word counting as a "Hello World" problem in the lectures and labs. However, there is still space to extend the problem, for example, the dealing of upper/lower cases, punctuation marks, top N frequent words, cooccurrence words etc.
2. Scientific data analysis. We have used the UK weather data as an example in the lab. You may extend this application by developing tools to provide more in-depth analysis of weather and climate.
3. Image conversion. The story of converting millions of image documents from TIFF to PDF by the New York Times has been a highlight of Hadoop. Technically, it is not a very complicated task. Perhaps you will have a try.
4. Network traffic analysis. Take a web server log file, and write a program to analyse the traffic to the server, for example, the number of visit for each IP address per unit time, the top N visitors etc, for a starting point.
5. Monte Carlo simulation. Estimation of pi (the ratio of a circle's circumference to its diameter) using the Monte Carlo method.
6. Social Media Analytics. Pete Warden’s infamous story with Facebook has caught the eye of people from both within and outside the data science community. Regardless your opinion on this particular case, the power of big data technologies has been demonstrated to a great extent. You do not have to get yourself into the troubles like Pete Warden did, but certainly you can use the legitimate approaches to explore the hidden values of social media such as capturing consumer attitudes, managing online reputation, anticipating customer needs and making recommendations, etc.
(Presentation - 20%) Introduction (10%)
Brief presentation of distributed methods to manage and analyse data and how they differ from traditional approaches
The Problem and Associated Dataset (15%)
Design & Implementation (20%)
Results (20%)
Conclusions (15%)
The assignment I got was complicated enough and it was hard for someone else to do it. Our professor had explained a proper technique and format to do it, I was worried if urgenthomework could do it or not? I had a talk with their customer care and they gave me the contact details of the expert who would do my work. I told them the procedure and I was surprised by the product delivery. It was an excellent work framed in the style and Format I wanted it.
I had a critical task accommodation due date. One of my companions recommended that I should hire the services of urgenthomework.com. When I put in the task request, they quickly acknowledged it and comprehended the earnestness and urgency of time. they conveyed my task until before a day of submission date. Services are as good as your writing is. And quite affordable as well. Wish you good luck for the future. Keep growing.
I have been using this website since last many years. It helps me in my college project and homework. Excellent study materials are provided which is easy to understand and learn. Read More
Follow Us