Back to DrivenData | Blog

business_id.txt file is missing in the new data

I downloaded the new data from the drivensata download page. But in the new folder, business_id.txt file is missing from the new data download. It was there in the old data.
Can anybody take a look into it ?

Hi @scigeek,

The business_id.txt file is just a list of the unique IDs from the Yelp dataset. We’ve put a new copy that matches the latest data release up on the data download page in case it’s useful!


@scigeek: couldn’t resist the opportunity to plug jq :smiley_cat:

Here’s the one-liner:

jq .business_id yelp_academic_dataset_business.json

Which results in:


If you don’t want the quotes you can pass jq the option -r for raw output.

Note: there may be a couple of IDs here that aren’t in restaurant_ids_to_yelp_ids.csv - that’s the authoritative mapping so I’d use that for any matching.

1 Like