DATA PRODUCT

USA POI & Foot Traffic Enriched Geospatial Dataset by Predik Data-Driven

Are you interested in this data product?
Copy linkCopy link
Share via emailShare via email
Integration details
Pricing

Data product description

Our dataset provides detailed and precise insights into the business, commercial, and industrial aspects of any given area in the USA (Including Point of Interest (POI) Data and Foot Traffic. The dataset is divided into 150x150 sqm areas (geohash 7) and has over 50 variables.

- Use it for different applications: Our combined dataset, which includes POI and foot traffic data, can be employed for various purposes. Different data teams use it to guide retailers and FMCG brands in site selection, fuel marketing intelligence, analyze trade areas, and assess company risk. Our dataset has also proven to be useful for real estate investment.

- Get reliable data: Our datasets have been processed, enriched, and tested so your data team can use them more quickly and accurately.

- Ideal for trainning ML models. 

The high quality of our geographic information layers results from more than seven years of work dedicated to the deep understanding and modeling of geospatial Big Data. Among the features that distinguished this dataset is the use of anonymized and user-compliant mobile device GPS location, enriched with other alternative and public data.

- Easy to use: Our dataset is user-friendly and can be easily integrated to your current models. Also, we can deliver your data in different formats, like .csv, according to your analysis requirements. 

- Get personalized guidance: In addition to providing reliable datasets, we advise your analysts on their correct implementation.

Our data scientists can guide your internal team on the optimal algorithms and models to get the most out of the information we provide (without compromising the security of your internal data).

Answer questions like: 
- What places does my target user visit in a particular area? Which are the best areas to place a new POS?
- What is the average yearly income of users in a particular area?
- What is the influx of visits that my competition receives?
- What is the volume of traffic surrounding my current POS?

This dataset is useful for getting insights from industries like:
- Retail & FMCG
- Banking, Finance, and Investment
- Car Dealerships
- Real Estate
- Convenience Stores
- Pharma and medical laboratories
- Restaurant chains and franchises
- Clothing chains and franchises

Our dataset includes more than 50 variables, such as:
- Number of pedestrians seen in the area.
- Number of vehicles seen in the area.
- Average speed of movement of the vehicles seen in the area.
- Point of Interest (POIs) (in number and type) seen in the area (supermarkets, pharmacies, recreational locations, restaurants, offices, hotels, parking lots, wholesalers, financial services, pet services, shopping malls, among others). 
- Average yearly income range (anonymized and aggregated) of the devices seen in the area.

Notes to better understand this dataset:
- POI confidence means the average confidence of POIs in the area. In this case, POIs are any kind of location, such as a restaurant, a hotel, or a library.
 
- Category confidences, for example
"food_drinks_tobacco_retail_confidence" indicates how confident we are in the existence of food/drink/tobacco retail locations in the area.
 
- We added predictions for The Home Depot and Lowe's Home Improvement stores in the dataset sample. These predictions were the result of a machine-learning model that was trained with the data. Knowing where the current stores are, we can find the most similar areas for new stores to open.

How efficient is a Geohash?
Geohash is a faster, cost-effective geofencing option that reduces input data load and provides actionable information. Its benefits include faster querying, reduced cost, minimal configuration, and ease of use.

Geohash ranges from 1 to 12 characters. The dataset can be split into variable-size geohashes, with the default being geohash7 (150m x 150m).

Data size

5 TB (the full and complete nationwide dataset)

Data sources

Mobile apps

Data attributes

geohash7, pedestrians, vehicles, device_flow, Latitude, Longitude, kmh_geohash7_avg, kmh_geohash7_min, kmh_geohash7_max, timestamp_avg + 11 more ...

Data frequency

Capturing frequency:
1 Per event
Transmission frequency:
1 Per event

Geographical coverage

United States

Language

English

Temporal availability

  • Real time Data
  • 7 days
  • Historical
  • 1460 days
mobito logo