What is DataGraft?

DataGraft is a cloud-based service for data transformation and data access.

Who are DataGraft's target users?

DataGraft is aimed at data workers and data developers interested in simplified and cost-effective solutions for managing their data, with a focus on data transformation, data hosting, and data access. Example users include open data publishers, linked data developers, data scientists.

Why DataGraft?

DataGraft was developed to provide better and easier to use tools for data workers and developers who consider existing approaches to data transformation, hosting, and access too costly and technically complex.

Why is DataGraft special?

DataGraft offers an integrated, flexible, and reliable cloud-based solution for data transformation and data access. Key features include flexible management of data transformations (e.g. interactive creation, execution, sharing, reuse) and reliable data services. DataGraft offers GUI and API access to data transformations and data services.

Are all my data and/or transformations on DataGraft public?

When creating transformations or publishing data you are hosting them on DataGraft. You can choose whether you want to make them public (i.e. visible on the DataGraft portal) or private (i.e. only you have access to them).

What can I do with DataGraft?

The current version of DataGraft allows you to:

  • Transform tabular data and share transformations: edit, host, execute, and share data transformations
  • Publish, share, and access data: Scalable data hosting and reliable data access / data querying
  • What data formats does DataGraft support?

    The current version of DataGraft handles tabular data (CSV) and RDF data (i.e. you can upload CSV and RDF data to DataGraft). Data transformations (e.g. data cleaning) can be performed on tabular data, which in turn can be transformed to RDF data. For RDF data, live data services are created that allow programmatic querying. More data formats are expected to be supported in the future.

    What querying languages does DataGraft support?

    The current version of DataGraft allows querying the hosted RDF data via SPARQL.

    What skills do I need to have to use DataGraft?

  • If you are interested in tabular data transformations (e.g. cleaning of tabular data, sharing of transformations, etc.), no special skills are required, depending on the complexity of the transformations you want to perform. Currently DataGraft supports a number of predefined data transformations, with more to be added in the near future.
  • If you are interested in publication of RDF data and access to RDF data, familiarity with RDF and SPARQL are needed.
  • Is there any documentation for DataGraft?

    Yes, draft documentation is available here. Documentation will be further improved in the upcoming period.

    For background materials on technical aspects related to DataGraft you can check out the proDataMarket technical reports. Useful presentations can be found here and here.

    I am a developer and interested to transform and host my transformations and data with DataGraft. Do I have API access to DataGraft capabilities?

    Yes, DataGraft capabilities are accessible at the API level. These include programmatic access to datasets and data transformation catalogues (adding, removing datasets and transformations), querying data, and executing data transformations. The capabilities offered through the DataGrat.net portal are a subset of capabilities offered at the API level. If you want to benefit from all the features of DataGraft, API level access is recommended. See the API documentation for further details.

    Is it free to use DataGraft?

    For now, yes. With the following limitations per account:

  • Data upload: You can upload CSV files of up to 10MB each, and RDF files of up to 100MB each;
  • Datapages: You can have up to 10 RDF datapages;
  • Persistent storage: You can host up to 2 GB of CSV data, and 1 Million RDF triples for RDF data.
  • If you find DataGraft useful and you are interested to use it beyond its current limitations please get in touch with us. As the development progresses these limitations may be relaxed for all accounts.

    What is a datapage?

    A datapage is a way to organize data. You can think of a datapage as a Web page with information about a dataset (e.g. metadata, link to download data). Currently we support two types of datapages: CSV datapages and RDF datapages. For technical reasons, at this stage, a CSV datapage can accomodate one input CSV file, however an RDF datapage may accomodate more than one input RDF file (within the limintations mentioned in the previous question).

    What is the status of the DataGraft development?

    DataGraft is currently under iterative development. The current version of DataGraftrepresents a second alpha release.

    Where can I report issues/bugs?

    Since DataGraft is currently in beta, it contains bugs. There are issues we know of, and likely others that we don’t yet know of. You can report issues here.

    What features are planned for future releases?

    Future releases of DataGraft will focus on improving or extending existing features. An unordered list of future feature enhancements includes:

  • Read-only transformations
  • Transformation history with undo/redo options
  • Support for various data formats: e.g., JSON, geospatial (GML, Shapefile)
  • Support for multiple files / join datasets
  • Push/pull services for updating outputs in different frequency
  • Better error reporting in transformations
  • Improving functions on tables (vs in pipeline)
  • Dynamic deployment of transformations (cloud-based)
  • (Public/Private) sharing of utility function
  • JavaScript transformation code
  • Predictive transformations, i.e. learn from previous transformations
  • Dealing with streams of data vs static files
  • Better traceability: files, data pages, transformations
  • Geospatial processing of data: radius around point, radius around polygon, data within polygon, squares, areas
  • Interested in any of the above issues? Get in touch with us!

    I need feature XYZ but it is not supported by DataGraft. What can I do?

    Contact us and we’ll be glad to assist!

    Whom can I contact if I have further comments and questions about DataGraft?

    Please contact us by email and we'll answer your questions as soon as possible!