ASIMOV module for data import powered by the Bright Data web data platform.
npm install asimov-brightdata-module




[ASIMOV] module for data import powered by the [Bright Data] web data platform.
- Imports structured data from Airbnb, Amazon, Crunchbase, eBay, Facebook,
Google, Indeed, Instagram, LinkedIn, Walmart, X (aka Twitter), Yahoo, and
YouTube.
- Collects the raw JSON data via the Bright Data API (requires an API key).
- Constructs a semantic knowledge graph based on the [KNOW] ontology.
- Supports plain JSON output as well as [RDF] output in the form of JSON-LD.
- Distributed as a standalone static binary with zero runtime dependencies.
- [Rust] 1.85+ (2024 edition) if building from source code
``bash`
pip install -U asimov-brightdata-module
`bash`
gem install asimov-brightdata-module
`bash`
npm install -g asimov-brightdata-module
`bash`
cargo install asimov-brightdata-module
`bash`
export BRIGHTDATA_API_KEY="..."
`bash`
asimov-brightdata-fetcher https://x.com/bright_init # JSON
asimov-brightdata-importer https://x.com/bright_init # JSON-LD
`bash`
asimov-brightdata-fetcher https://www.linkedin.com/in/orlenchner/
asimov-brightdata-fetcher https://www.linkedin.com/company/bright-data/
`bash`
asimov-brightdata-fetcher https://www.crunchbase.com/organization/brightdata
`bash`
asimov-brightdata-fetcher https://www.amazon.com/Master-Algorithm-Ultimate-Learning-Machine/dp/0465094279
- BRIGHTDATA_API_KEY: (required) the [Bright Data API key] to use
- asimov-brightdata-cataloger: discovers entities via the Bright Data APIasimov-brightdata-fetcher
_(not implemented yet)_
- : collects JSON data from the Bright Data APIasimov-brightdata-importer
- : collects and transforms JSON into JSON-LD
_(not implemented yet)_
Dataset | URL Prefix | JSON | RDF
:------ | :--------- | :--: | :--:
Airbnb | https://www.airbnb.com/rooms/ | ✅ | 🚧https://www.amazon.com/
Amazon | | ✅ | 🚧https://www.amazon.com/sp?seller=
| | ✅ | 🚧https://www.crunchbase.com/organization/
Crunchbase | | ✅ | 🚧https://www.ebay.com/itm/
eBay | | ✅ | 🚧https://www.facebook.com/events/
Facebook | | ✅ | 🚧https://www.facebook.com/groups/
| | ✅ | 🚧https://www.facebook.com/marketplace/item/
| | ✅ | 🚧https://www.facebook.com/share/p/
| | ✅ | 🚧https://www.google.com/shopping/product/
Google | | ✅ | 🚧https://www.indeed.com/cmp/
Indeed | | ✅ | 🚧https://www.instagram.com/
Instagram | | ✅ | 🚧https://www.instagram.com/p/
| | ✅ | 🚧https://www.instagram.com/reel/
| | ✅ | 🚧https://www.linkedin.com/company/
LinkedIn | | ✅ | 🚧https://www.linkedin.com/in/
| | ✅ | 🚧https://www.linkedin.com/jobs/
| | ✅ | 🚧https://www.linkedin.com/posts/
| | ✅ | 🚧https://www.linkedin.com/pulse/
| | ✅ | 🚧https://www.walmart.com/global/seller/
Walmart | | ✅ | 🚧https://www.walmart.com/ip/
| | ✅ | 🚧https://x.com/
X (Twitter) | | ✅ | ✅https://finance.yahoo.com/quote/
Yahoo | | ✅ | 🚧https://www.youtube.com/@
YouTube | | ✅ | 🚧https://www.youtube.com/watch?v=
| | ✅ | 🚧 |
|
|
`bash``
git clone https://github.com/asimov-modules/asimov-brightdata-module.git
---





[ASIMOV]: https://github.com/asimov-platform
[Bright Data]: https://brightdata.com/products/web-scraper
[Bright Data API key]: https://docs.brightdata.com/general/account/api-token
[JSON-LD]: https://json-ld.org
[KNOW]: https://github.com/know-ontology
[NPM]: https:/npmjs.org
[Python]: https://python.org
[RDF]: https://github.com/rust-rdf
[Ruby]: https://ruby-lang.org
[Rust]: https://rust-lang.org