1. Title: Data-Driven Genomic Computing: Making Sense of the Signals from the Genome

    Keynote Lecturer: Stefano Ceri
    Presented on: 24/07/2017, Madrid, Spain

    Abstract: Genomic computing is a new science focused on understanding the functioning of the genome, as a premise to fundamental discoveries in biology and medicine. Next Generation Sequencing (NGS) allows the production of the entire human genome sequence at a cost of about 1000 US; many algorithms exist for the extraction of genome features, or "signals", including peaks (enriched regions), mutations, or gene expression (intensity of transcription activity). The missing gap is a system supporting data integration and exploration, giving a “biological meaning” to all the available information; such a system can be used, e.g., for better understanding cancer or how environment influences cancer development.The GeCo Project (Data-Driven Genomic Computing, ERC Advanced Grant, 2016-2021) has the objective or revisiting genomic computing through the lens of basic data management, through models, languages, and instruments, focusing on genomic data integration. Starting from an abstract model, we developed a system that can be used to query processed data produced by several large Genomic Consortia, including Encode and TCGA; the system employs internally the Spark engine, and prototypes can already be accessed from Cineca or from PoliMi servers. During the five-years of the ERC project, the system will be enriched with data analysis tools and environments and will be made increasingly efficient.Among the objectives of the project, the creation of an “open source” repository of public data, available to biological and clinical research through queries, web services and search interfaces.

    Event's Websites: icsoft.org/
    dataconference.org/

    Presented at the following Events:
    ICSOFT, 12th International Conference on Software Technologies
    DATA, 6th International Conference on Data Science, Technology and Applications

    # vimeo.com/239084173 Uploaded 6 Plays 0 Comments
  2. The European Project Space panel held in Lisbon on Monday 25th entitled “The Role of Software Technology in the Fourth Industrial Revolution” was chaired by Ricardo J. Machado from Universidade do Minho, Portugal.
    The present panelists that have related their experiences working in the context of different European projects were:
    - João Mil-Homens, Horizon 2020 National Contact Point - ICT, SME, FTI - Fundação para a Ciência e Tecnologia | Agência Nacional Inovação, Portugal
    - Christoph Quix, RWTH Aachen University, Germany
    - Francisco Almada Lobo, CRITICAL Manufacturing, Portugal
    - Pedro Vaz Silva, Bosch Car Multimedia, Portugal

    # vimeo.com/180012126 Uploaded 8 Plays 0 Comments
  3. Keynote Title: Schema Evolution for Relational Databases
    Keynote Lecturer: Panos Vassiliadis
    Presented on: 25/07/2016, Lisbon, Portugal
    Abstract: Like all software systems, databases are subject to evolution as time passes. The impact of this evolution is tremendous as every change to the schema of a database affects the syntactic correctness and the semantic validity of all the surrounding applications and in fact necessitates their maintenance in order to remove errors from their source code. The talk will provide a walk-through on the current state of knowledge on the mechanics of schema evolution for relational databases. The main lessons learned from the existing case studies will be discussed; moreover, recent findings on frequent patterns of change will also be presented. Open issues for further research will be discussed at the end of the talk.
    Conference Website: dataconference.org/
    Presented at the following Conference: DATA, 5th International Conference on Data Management Technologies and Applications

    # vimeo.com/178350220 Uploaded 52 Plays 0 Comments
  4. Keynote Title: Data Lakes: A Solution or a new Challenge for Big Data Integration?
    Keynote Lecturer: Christoph Quix
    Presented on: 26/07/2016, Lisbon, Portugal
    Abstract: “Data Lake” is a new concept that has been introduced in the Big Data field to address the problem of the integration of heterogeneous information. Silos of isolated information should be avoided by loading the data into a coherent data repository. In contrast to classical ETL processes as in data warehouse systems, the transformation step is skipped and data is loaded in its original structure to avoid upfront integration efforts and to make all source data available for later data analysis tasks. The transformation is done at a later phase in which the target application is more clear and a more powerful data processing framework (e.g., Hadoop) is available. Although the idea of a data lake seems to be an attractive solution to make Big Data integration more efficient, the original problems of data integration are not resolved. Data is still very heterogeneous in its structure, semantics, and quality. In order to avoid that the data lake turns into a data swamp, we propose a metadata-driven and quality-oriented approach for data lake management. Components for automatic metadata extraction and enrichment, semantic annotations, and quality monitoring are key elements of our architecture. The talk will give an overview of the state-of-the-art and the state-of-practice in data lakes and point out the challenges for future research.
    Conference Website: dataconference.org/
    Presented at the following Conference: DATA, 5th International Conference on Data Management Technologies and Applications

    # vimeo.com/178329017 Uploaded 103 Plays 0 Comments
  5. Keynote Title: The Provenance of Consumer and Social Media Data

    Keynote Lecturer: Paul Longley

    Presented on: 20/07/2015, Colmar, Alsace, France

    Abstract: This presentation reports on the research activities of the Consumer Data Research Centre (CDRC), which is one of the UK’s current ‘Big Data’ investments funded by the Economic and Social Research Council (ESRC). Established in 2014, the CDRC’s mission is to bring sharper focus to the deployment and use of business and social media data, in support of decision-making across a widening spectrum of applications. After describing the three tier service structure of the CDRC, this presentation sets out the range of applications that are under development, the researcher and user interfaces that have been devised, and the ways in which business data may be evaluated and linked to conventional social survey sources. The presentation then focuses upon issues of establishing the provenance of business and social media data, and the wider implications of Big Data for the practice of social science. It also discusses some practical ways in which the value of new data sources may be reliably assessed.These ideas are illustrated using an extended case study of the use of Twitter geo-temporal demographics to understand the activity patterns of different ethnic groups in London. These patterns are linked to the geography of residence as depicted using conventional data sources such as the UK Census of Population.

    Presented at the following Conference:
    - DATA, 4th International Conference on Data Management Technologies and Applications

    Conference Website:
    - dataconference.org/

    # vimeo.com/137812176 Uploaded 56 Plays 0 Comments

DATA

INSTICC PRO

International Conference on Data Management Technologies and Applications (DATA) aims to bring together researchers, engineers and practitioners interested on databases, data warehousing, data mining, data management, data security and other aspects of…


+ More

International Conference on Data Management Technologies and Applications (DATA) aims to bring together researchers, engineers and practitioners interested on databases, data warehousing, data mining, data management, data security and other aspects of information systems and technology involving advanced applications of data.

Browse This Channel

Shout Box

Heads up: the shoutbox will be retiring soon. It’s tired of working, and can’t wait to relax. You can still send a message to the channel owner, though!

Channels are a simple, beautiful way to showcase and watch videos. Browse more Channels.