Planet Open Data

Saturday, 23. December 2017

Open Data Questions

Historical Forward Exchange rates for QAR (Saudi Riyal)

I need data of 3-month Forward exchange rates between 5 pairs of currencies. Saudi Arabia Riyal (QAR or SAR) on one hand; and 5 currencies, USD, GBP, CHF, EUR, CNY on the other hand. I need quarterly data for period 2000-2015. I can find spot rates, but desperate to find forward rates. I can pay small amount of money, if anybody can help. Need urgently

I need data of 3-month Forward exchange rates between 5 pairs of currencies. Saudi Arabia Riyal (QAR or SAR) on one hand; and 5 currencies, USD, GBP, CHF, EUR, CNY on the other hand. I need quarterly data for period 2000-2015. I can find spot rates, but desperate to find forward rates. I can pay small amount of money, if anybody can help. Need urgently


Pagination Type for Healthcare Finder API

Does anyone know the type of pagination that should be used when making a getCountiesForZip request from the HealthCare finder API?

I'm trying to use it with Qlik Sense which offers the following types found at help.qlik.com/en-US/connectors/Subsystems/REST_connector_help/Content/1.0/Create-REST-connection/Pagination-scenarios.htm:

Offset uses a starting value from which to read

Does anyone know the type of pagination that should be used when making a getCountiesForZip request from the HealthCare finder API?

I'm trying to use it with Qlik Sense which offers the following types found at http://help.qlik.com/en-US/connectors/Subsystems/REST_connector_help/Content/1.0/Create-REST-connection/Pagination-scenarios.htm:

Offset uses a starting value from which to read additional records.

Next token uses a token that is passed to the URL call for the next set of records.

Next URL uses a value that contains the URL for the next set of records.

Custom is a special option that can be used when none of the other paging options are implemented.


wikidata extract

I have a list of wikidata entity IDs (i.e. Tour Eiffel Q243, Big Ben Q41225...), and a list of properties (i.e. coordinates P625, country P18 ...).

Is there any way to gather and extract in Excel (or csv) a table with the information?

What I look for is something like:

[♦

I have a list of wikidata entity IDs (i.e. Tour Eiffel Q243, Big Ben Q41225...), and a list of properties (i.e. coordinates P625, country P18 ...).

Is there any way to gather and extract in Excel (or csv) a table with the information?

What I look for is something like:

[table[1]


Is there any open dataset for named entity linking?

Entity linking is the task of detecting mentions of entities from a knowledge base in a document.

Are there any openly available datasets to train and evaluate such systems? The datasets I came across are all closed (for instance, TAC-KBP is owned by the Linguistic Data Consortium, the YAGO-AIDA dataset requires access to the original CoNLL 2003 dataset, and so on).

The dataset s

Entity linking is the task of detecting mentions of entities from a knowledge base in a document.

Are there any openly available datasets to train and evaluate such systems? The datasets I came across are all closed (for instance, TAC-KBP is owned by the Linguistic Data Consortium, the YAGO-AIDA dataset requires access to the original CoNLL 2003 dataset, and so on).

The dataset should contain some text with reference annotations (for instance links to Wikipedia), and should be of reasonable size for machine learning purposes. For instance, Wikipedia dumps themselves provide this to some extent, but the wikilinks are generally not intended to be complete, so the annotations are very sparse.

Monday, 23. October 2017

Open Data Questions

Where to find fracture xray imaging data?

Where can I retrieve X-ray image data of fractures?

Labeled data containing features like age, type of fracture would be best.

Where can I retrieve X-ray image data of fractures?

Labeled data containing features like age, type of fracture would be best.


MIMIC-III: many patients without prescription information?

Of the 46,520 patients listed in the PATIENTS database, 7,157 do not appear in the PRESCRIPTIONS database. About half of these are newborns (age < 1 year), but there remains about 3600 patients w/ mean age 73 (approx normally distributed) who have no medication info.

Why is there no medication information on 8% of the patients in the database? Is this group random or are they special

Of the 46,520 patients listed in the PATIENTS database, 7,157 do not appear in the PRESCRIPTIONS database. About half of these are newborns (age < 1 year), but there remains about 3600 patients w/ mean age 73 (approx normally distributed) who have no medication info.

Why is there no medication information on 8% of the patients in the database? Is this group random or are they special in some way?


Need data to access localized weather data

I am trying to build an application similar to zyGrib where in I want to add wind data onto an open layers map for a small local area. I am aware that there are public sources that I can access to get this data but I haven't been successful in finding one though. I want to preferably be able to access a very localized data set instead of downloading a huge global data set.

I am trying to build an application similar to zyGrib where in I want to add wind data onto an open layers map for a small local area. I am aware that there are public sources that I can access to get this data but I haven't been successful in finding one though. I want to preferably be able to access a very localized data set instead of downloading a huge global data set.


open access database with companies/large employers registered in a given city in US

Is anyone aware of an open access database with companies/large employers registered in a given city in US, specifying registration date (office opening).

Is anyone aware of an open access database with companies/large employers registered in a given city in US, specifying registration date (office opening).

Thursday, 21. December 2017

Open Data Questions

Obesity/Inactivity Data at a census tract level

Looking for obesity/inactivity data for adults at a census tract level for the state of California. Any good sources? Preferably in a format that can be uploaded to arcdesktop.

Looking for obesity/inactivity data for adults at a census tract level for the state of California. Any good sources? Preferably in a format that can be uploaded to arcdesktop.


Dataset for IAB taxonomy text classification?

I'd like to train a system that takes text and predicts IAB classes (www.iab.com/guidelines/iab-quality-assurance-guidelines-qag-taxonomy/).

Are there any public datasets available for this?

I'd like to train a system that takes text and predicts IAB classes (https://www.iab.com/guidelines/iab-quality-assurance-guidelines-qag-taxonomy/).

Are there any public datasets available for this?


Looking for labeled audio data for sentiment

I'm looking for labelled audio data. Like a cat meowing, or a spoon falling on the ground, a car driving past, etc... i.e. sound clips of events. Does anyone know of where to find this? Would be very grateful for any help here.

I'm looking for labelled audio data. Like a cat meowing, or a spoon falling on the ground, a car driving past, etc... i.e. sound clips of events. Does anyone know of where to find this? Would be very grateful for any help here.


historical price data for Sdax Index

I am looking for historical price data in .csv or .txt format to download for the following financial instrument:

Sdax Performance Index (It's a german smallcap indice in case you care)

ISIN: DE0009653386
RIC: ^SDAXI

I need the data starting from 30.12.1987 as far as possible.

I am looking for historical price data in .csv or .txt format to download for the following financial instrument:

Sdax Performance Index (It's a german smallcap indice in case you care)

ISIN: DE0009653386
RIC: ^SDAXI

I need the data starting from 30.12.1987 as far as possible.


Auto dealers database

Where can I find an auto dealer database?

Format database: Dealer name, address, brand, phone, website, and more other information. If possible, the worldwide. If it is impossible for the world, I will be glad and individual countries (interested in any of the countries).

I can parse results from website or database XML, but I don't know where to find it.

Where can I find an auto dealer database?

Format database: Dealer name, address, brand, phone, website, and more other information. If possible, the worldwide. If it is impossible for the world, I will be glad and individual countries (interested in any of the countries).

I can parse results from website or database XML, but I don't know where to find it.


Open Data Portal/Software for Live Measurements: is there any?

I am working in meteorology/glaciology. We have quite a lot of observations (partly live!) and we would like to publish these data (live!) to the world.

So what we are looking for is a portal/software that allows publishing such "time series" data. This would include data such as temperature measurements, precipitation measurements, maybe glacier length changes, and so far and so on.

I am working in meteorology/glaciology. We have quite a lot of observations (partly live!) and we would like to publish these data (live!) to the world.

So what we are looking for is a portal/software that allows publishing such "time series" data. This would include data such as temperature measurements, precipitation measurements, maybe glacier length changes, and so far and so on.

There is a chance to get some money of the national science fund - smaller projects to make data from past projects available to all of you. As I could not find anything suitable on the web:

  • Is there something like this but I was too stupid to find it?
  • If not: do you think that there is a need for it?

The idea - if not yet available - would be an open source software including a flexible backend, data upload interface (e.g., xml data xchange via scp/ftp/web upload), and a frontend offering "simple" data series plots, data exports, and that the uploader/maintainer of these data sets can write/upload notes, manuals, important information (e.g., instrument correction coefficients, when instruments have been maintained/replaced, ...).

Thank you very much for the input! We are currently in a "discussion" or "rough project planning" phase and all comments or hints will be more than helpful!

Monday, 18. December 2017

Open Data Questions

what is the difference between drugs@fda and openfda?

What is the difference between (drugs@fda or drugs@fda data files) and openfda? Is there a recommended resource to use for drug name lookup?

What is the difference between (drugs@fda or drugs@fda data files) and openfda? Is there a recommended resource to use for drug name lookup?


Dataset for emotion classification into happy, sad, angry

I am looking for a dataset for Mood or emotion (Happy, Angry, Sad) classification.That is to classify a text is it a happy, angry or sad related sentential text. I have used naive Bayes classification for this analysis. Now just to train and test the model with the dataset, we require a strong one. We are not getting a good efficiency with the current datasets that we are using, can you please

I am looking for a dataset for Mood or emotion (Happy, Angry, Sad) classification.That is to classify a text is it a happy, angry or sad related sentential text. I have used naive Bayes classification for this analysis. Now just to train and test the model with the dataset, we require a strong one. We are not getting a good efficiency with the current datasets that we are using, can you please suggest a strong one?


Data on user-user trust ratings and user-item ratings

I am doing a research project on Recommender System, where I need data of user-item ratings and user-user trust ratings. Publicly available similar datasets are

  • Film trust dataset
  • Epinion dataset

Both have user-item ratings but don't have user-user trust ratings; instead they have user-user trust statement. Also they only contain positive trust statement a

I am doing a research project on Recommender System, where I need data of user-item ratings and user-user trust ratings. Publicly available similar datasets are

Both have user-item ratings but don't have user-user trust ratings; instead they have user-user trust statement. Also they only contain positive trust statement and don't have negative trust statement.

The difference between statement and ratings is, statement can only have value of 1(trusted) or -1(not trusted), but rating will have values in range 0-5 or 0-1.

So is there any prior work on this type dataset and where can I find these type dataset?


Historic Road Data

I would like to know if there is any data repository that can provide historic road data, starting from 1995 to 2017? The location of the data can be arbitrary, except United Kingdom. Also, I am looking for as much possible hierarchically available roads, with main focus on major road, secondary, connector etc. I have also looked on planet.openstreetmap.org/planet/full-history/, but the data tha

I would like to know if there is any data repository that can provide historic road data, starting from 1995 to 2017? The location of the data can be arbitrary, except United Kingdom. Also, I am looking for as much possible hierarchically available roads, with main focus on major road, secondary, connector etc. I have also looked on https://planet.openstreetmap.org/planet/full-history/, but the data that are provided do not fit on my requirements.

I want to compare different street networks in different time intervals, thus the starting date must be between 1995-2000 in order to capture the slow rate of a road network's evolution.


Public International Database of Literature Works?

There's a question about books: (Public database of book titles?), but what I'm looking for is actually a DB of literature works, not books.

For example, "Romeo and Juliet" is a single literature work, but there are multiple books published with it.

So this is definitely different.

There's a question about books: (Public database of book titles?), but what I'm looking for is actually a DB of literature works, not books.

For example, "Romeo and Juliet" is a single literature work, but there are multiple books published with it.

So this is definitely different.


Federal Reserve - Tremendous amount of data

In the world of stock and hedge funding, does there exist, for instance, a way to gather a tremendous amount of data on the price of wheat from 20 years on the stock exchange (For instance, on Wall Street stock)? In fact, I'd like to treat, statistically, the price of a stock product, but I don't know where we may find out that type of data.

In the James Simons's discussions (A rare in

In the world of stock and hedge funding, does there exist, for instance, a way to gather a tremendous amount of data on the price of wheat from 20 years on the stock exchange (For instance, on Wall Street stock)? In fact, I'd like to treat, statistically, the price of a stock product, but I don't know where we may find out that type of data.

In the James Simons's discussions (A rare interview with the mathematician who cracked Wall Street), Simons explains that "The real thing was to gather a tremendous amount of data -- and we had to get it by hand in the early days. We went down to the Federal Reserve and copied interest rate histories and stuff like that, because it didn't exist on computers." (Time : 11:26), but where is that Federal Reserve?


Where to find household financial data for my research

I am a programming student, I want to make a financial application for my project. The application works by gathering user's income and expenses to create a prediction about their financial status next month.

My project is to create research about the application I want to make. The problem is, I need some real data to calculate the accuracy of my application. The data I need are income

I am a programming student, I want to make a financial application for my project. The application works by gathering user's income and expenses to create a prediction about their financial status next month.

My project is to create research about the application I want to make. The problem is, I need some real data to calculate the accuracy of my application. The data I need are income and expenses of some people for at least 5 months. Does anybody know where I can possibly find this kind of data, or how can I make a survey to get this data?

In addition, I can't wait for 5 months to gather the data.


Where can I get consistent global map data for topology and environmental properties?

I'm a developer trying to procedurally generate a Minecraft map based on the real Earth. I've recently spent a few days looking for Earth data for my generator to use, but I've found very few good results.

Here's a list of data sets I could use for generating such a map:

  • Elevation
  • Bathymetric (undersea depth)
  • Vegetation(coverage percentage, type of vegetat

I'm a developer trying to procedurally generate a Minecraft map based on the real Earth. I've recently spent a few days looking for Earth data for my generator to use, but I've found very few good results.

Here's a list of data sets I could use for generating such a map:

  • Elevation
  • Bathymetric (undersea depth)
  • Vegetation(coverage percentage, type of vegetation, etc.)
  • Soil type
  • Crust composition data
  • Water salinity

I'm extremely new to global data collection, so there's probably a host of other properties which could improve the map. However, I've not been able to find a consistent source for getting such data. I've found a few sources, but each of them has problems.

For example, this land-cover dataset has decent vegetation and soil maps, but their resolution is 1 degree² per pixel and there are a large amount of holes. Ideally, I'm looking for something at least with a resolution of 0.01² degrees (36 arcseconds). For this project, I'm looking for maps in/convertible to the Miller or equirectangular projection.

If there are absolutely no other options, I can just use a vector tracing algorithm to roughly upscale the vegetation maps and then fill in the holes, but that should be a last resort.

Thursday, 19. October 2017

Open Data Questions

Difference between Inspection Observation and Inspection Citation reports

What is the difference between the FDA Inspection Observation and Inspection Citation reports? The inspection citation report appears to have slightly less data but provides more details for what is included.

What is the difference between the FDA Inspection Observation and Inspection Citation reports? The inspection citation report appears to have slightly less data but provides more details for what is included.


Sidewalk accessibility mapping in California

I'm looking for a data-driven way to virtually assess accessibility of sidewalks in California, particularly in the Bay Area. Basically, any tools that would help with estimating a 'walkscore' type of rating for sidewalks would be ideal.

FYI - There is a group of researchers at UMD doing something similar for sidewalks in Washington DC. Here's the link: sidewalk.umiacs.umd.edu

If

I'm looking for a data-driven way to virtually assess accessibility of sidewalks in California, particularly in the Bay Area. Basically, any tools that would help with estimating a 'walkscore' type of rating for sidewalks would be ideal.

FYI - There is a group of researchers at UMD doing something similar for sidewalks in Washington DC. Here's the link: http://sidewalk.umiacs.umd.edu

If you know any tools/ resources/ contacts who would be useful or can point me in the right direction, please let me know.

Thanks in advance!

pluto.models/1.4.0, feed.parser/1.0.0, feed.filter/1.1.1 - Ruby/2.0.0 (2014-11-13/x86_64-linux) on Rails/4.2.0 (production)