Powered by OpenAIRE graph

Neo Technology UK (Neo4J)

Neo Technology UK (Neo4J)

2 Projects, page 1 of 1
  • Funder: UK Research and Innovation Project Code: EP/M025268/1
    Funder Contribution: 4,557,640 GBP

    Data is everywhere, generated by increasing numbers of applications, devices and users, with few or no guarantees on the format, semantics, and quality. The economic potential of data-driven innovation is enormous, estimated to reach as much as £40B in 2017, by the Centre for Economics and Business Research. To realise this potential, and to provide meaningful data analyses, data scientists must first spend a significant portion of their time (estimated as 50% to 80%) on "data wrangling" - the process of collection, reorganising, and cleaning data. This heavy toll is due to what is referred as the four V's of big data: Volume - the scale of the data, Velocity - speed of change, Variety - different forms of data, and Veracity - uncertainty of data. There is an urgent need to provide data scientists with a new generation of tools that will unlock the potential of data assets and significantly reduce the data wrangling component. As many traditional tools are no longer applicable in the 4 V's environment, a radical paradigm shift is required. The proposal aims at achieving this paradigm shift by adding value to data, by handling data management tasks in an environment that is fully aware of data and user contexts, and by closely integrating key data management tasks in a way not yet attempted, but desperately needed by many innovative companies in today's data-driven economy. The VADA research programme will define principles and solutions for Value Added Data Systems, which support users in discovering, extracting, integrating, accessing and interpreting the data of relevance to their questions. In so doing, it uses the context of the user, e.g., requirements in terms of the trade-off between completeness and correctness, and the data context, e.g., its availability, cost, provenance and quality. The user context characterises not only what data is relevant, but also the properties it must exhibit to be fit for purpose. Adding value to data then involves the best effort provision of data to users, along with comprehensive information on the quality and origin of the data provided. Users can provide feedback on the results obtained, enabling changes to all data management tasks, and thus a continuous improvement in the user experience. Establishing the principles behind Value Added Data Systems requires a revolutionary approach to data management, informed by interlinked research in data extraction, data integration, data quality, provenance, query answering, and reasoning. This will enable each of these areas to benefit from synergies with the others. Research has developed focused results within such sub-disciplines; VADA develops these specialisms in ways that both transform the techniques within the sub-disciplines and enable the development of architectures that bring them together to add value to data. The commercial importance of the research area has been widely recognised. The VADA programme brings together university researchers with commercial partners who are in desperate need of a new generation of data management tools. They will be contributing to the programme by funding research staff and students, providing substantial amounts of staff time for research collaborations, supporting internships, hosting visitors, contributing challenging real-life case studies, sharing experiences, and participating in technical meetings. These partners are both developers of data management technologies (LogicBlox, Microsoft, Neo) and data user organisations in healthcare (The Christie), e-commerce (LambdaTek, PricePanda), finance (AllianceBernstein), social networks (Facebook), security (Horus), smart cities (FutureEverything), and telecommunications (Huawei).

    more_vert
  • Funder: UK Research and Innovation Project Code: EP/L015358/1
    Funder Contribution: 3,523,120 GBP

    Cloud computing offers the ability to acquire vast, scalable computing resources on-demand. It is revolutionising the way in which data is stored and analysed. The dynamic, scalable approach to analysis offered by cloud computing has become important due to the growth of "big data": the large, often complex, datasets now being created in almost all fields of activity, from healthcare to e-commerce. Unfortunately, due to a lack of expertise, the full potential of cloud computing for extracting knowledge from big data has rarely been achieved outside a few large companies; as a result, many organisations fail to realize their potential to be transformed through extracting more value from the data available to them. UK industry faces a huge skills gap in this area as the demand for big data staff has risen exponentially (912%) over the past five years from 400 advertised vacancies in 2007 to almost 4,000 in 2012 (e-skills UK, Jan 2013). In addition, the demand for big data skills will continue to outpace the demand for standard IT skills, with big data vacancies forecast to increase by around 18% per annum in comparison with 2.5% for IT. Over the next five years this equates to a 92% rise in the demand for big data skills with around 132K new jobs being created in the UK (e-skills UK, Jan 2013). While characteristics such as size, data dependency and the nature of business activity will affect the potential for organisations to realise business benefits from big data, organisations don't have to be big to have big data issues. The problems and benefits are as true for many SMEs as they are for big business which, inevitably broadens and increases the demand for cloud and big data skills. Further, even when security concerns prevent the use of external "public" clouds for certain types of data, organisations are applying the same approaches to their own internal IT resources, using virtualisation to create "private" clouds for data analysis. Addressing these challenges requires expert practitioners who can bridge between the design of scalable algorithms, and the underlying theory in the modelling and analysis of data. It is perhaps not surprising that these skills are in short supply: traditional undergraduate and postgraduate courses produce experts in one or the other of these areas, but not both. We therefore propose to create a multi-disciplinary CDT to fill this significant gap. It will produce multi-disciplinary experts in the mathematics, statistics and computing science of extracting knowledge from big data, with practical experience in exploiting this knowledge to solve problems across a range of application domains. Based on a close collaboration between the School of Computing Science and the School of Mathematics and Statistics at Newcastle University, the CDT will address market requirements and overcome the existing skills barriers. The student intake will be drawn from graduates in computing science, mathematics and statistics. Initial training will provide the core competencies that the students will require, before they collaborate in group projects that teach them to address real research challenges drawn from application domains, before moving on to their individual PhD topic. The PhD topics will be designed to allow the students to focus deeply on a real-world problem the solution of which requires an advance in the underlying computing, maths and statistics. To reinforce this focus, they will spend time on a placement hosted by an industrial or applied academic partner facing that problem. Their PhD research will therefore deepen their knowledge of the field and teach them how to exploit it to solve challenging problems. Working in the new, custom-designed Cloud Innovation Centre, the students will derive continuous benefit from being co-located with researchers, industry experts, and their fellow students; immersing them in a group with a wide range of skills, knowledge and experiences.

    more_vert

Do the share buttons not appear? Please make sure, any blocking addon is disabled, and then reload the page.

Content report
No reports available
Funder report
No option selected
arrow_drop_down

Do you wish to download a CSV file? Note that this process may take a while.

There was an error in csv downloading. Please try again later.