This scraper captures the current cannabis product information from the Ontario Cannabis Store website.
To get the data scraped so far, visit https://morph.io/OddBloke/ontario_cannabis_store_scraper.
The database produced by the scraper (and available for download
here) has
two tables. data
contains the results from the most recent scrape,
for immediate access. history
contains all the result we've ever
scraped; these are in the same format as data
with a timestamp
column added. The timestamp will be the same for a particular scraping
run.
If you download the database from morph.io, then you can run the following queries using SQLite locally. Alternatively, morph.io do provide an API for running queries; you can find it here.
This will show you the products added between the most recent scrape and the one before it:
sql
SELECT url
FROM history
WHERE timestamp = (SELECT DISTINCT timestamp
FROM history
ORDER BY timestamp DESC
LIMIT 1)
AND url NOT IN (SELECT url
FROM history
WHERE timestamp = (SELECT DISTINCT timestamp
FROM history
ORDER BY timestamp DESC
LIMIT 1, 1));
This will show you the 5 highest average THC/$ dried flower products:
sql
SELECT name,
( thc_high + thc_low ) / 2 / price AS value,
url
FROM data
WHERE type = "Dried Flowers"
ORDER BY value DESC
LIMIT 5;
And we can do the same for CBD:
sql
SELECT name,
( cbd_high + cbd_low ) / 2 / price AS value,
url
FROM data
WHERE type = "Dried Flowers"
ORDER BY value DESC
LIMIT 5;
(Note that these queries won't work if you include other product types, because the weights for other products are variable.)
To download data sign in with GitHub
rows 10 / 156781
terpenes | sku | timestamp | plant_type | cbd_low | price | name | brand | cbd_high | thc_high | url | description | type | thc_low | image | 3.5g_price | 7g_availability | 1g_price | 1g_availability | 3.5g_availability | 7g_price | 15g_price | 15g_availability | standalone_availability | standalone_price | 0.5g_price | 0.5g_availability | 1.5g_availability | 2.5g_availability | 2.5g_price | 1.5g_price | 5g_availability | 5g_price | 1.25g_price | 1.25g_availability |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Myrcene,Alpha Pinene,Beta Caryophyllene,Beta-Pinene,Limonene
|
00694144000646
|
1541547120
|
Sativa dominant
|
0.0
|
9.55
|
Sunday Special
|
RIFF
|
1.0
|
23.2
|
RIFF’s Sunday Special is a musky sativa-dominant strain that the makers claim will produce feelings of relaxation, happiness and/or euphoria.
|
Dried Flowers
|
14.08
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Myrcene,Limonene,Humulene,Nerolidol
|
00882464000013
|
1541547120
|
Sativa dominant
|
0.0
|
10.25
|
Super Sonic
|
Symbl
|
1.0
|
17.0
|
Super Sonic from Symbl is a sativa-dominant hybrid with a strong, earthy, sweet aroma, reminiscent of Quantum Kush.
|
Dried Flowers
|
11.0
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Beta Caryophyllene,Myrcene,Trans Caryophyllene,Humulene,Limonene
|
00671148402034
|
1541547120
|
Hybrid
|
0.0
|
6.65
|
City Lights Pre-Roll
|
Edison
|
1.0
|
20.0
|
City Lights is an indica-dominant strain from Edison. It’s highly THC potent and said to help with relaxation, uplift and happiness. Available in one 0.5 g pre-roll.
|
Pre-Rolled
|
12.0
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Alpha Pinene,Beta Caryophyllene,Limonene,Myrcene,Trans Caryophyllene
|
00841464000775
|
1541547120
|
Indica dominant
|
0.0
|
9.5
|
Fantasy Island
|
SYNR.G
|
1.0
|
18.0
|
Fantasy Island is a SYNR.G indica-dominant strain with a mid to high THC potency.
|
Dried Flowers
|
12.0
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Beta Caryophyllene,Myrcene,Trans Caryophyllene,Alpha Pinene,Humulene
|
00671148402027
|
1541547120
|
Sativa dominant
|
0.0
|
6.65
|
Rio Bravo Pre-Roll
|
Edison
|
1.0
|
20.0
|
Rio Bravo is a sativa-dominant strain from Edison with high THC potency and the potential to boost energy, creativity and euphoria. Available in one 0.5 g pre-roll.
|
Pre-Rolled
|
12.0
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Beta Caryophyllene,Trans Caryophyllene,Humulene,Myrcene,Limonene
|
00628242240420
|
1541547120
|
Indica dominant
|
0.0
|
10.05
|
God Bud
|
Redecan
|
1.0
|
21.0
|
God Bud by Redecan is an indica-dominant hybrid with tropical flavour and mid-range THC potency.
|
Dried Flowers
|
13.0
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Myrcene,Beta Caryophyllene,Limonene,Beta-Pinene,Linalool
|
00694144000196
|
1541547120
|
Hybrid
|
8.0
|
9.55
|
Balance
|
Solei
|
13.0
|
8.0
|
Solei Balance’s indica-dominant dark green buds are sun-grown in a greenhouse and harvested for optimum spiciness.
|
Dried Flowers
|
4.0
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Beta-Pinene,Limonene,Nerolidol,Linalool
|
00882464000105
|
1541547120
|
Hybrid
|
0.0
|
10.25
|
Solar Power
|
Symbl
|
1.0
|
24.0
|
Similar to Sour Kush, Solar Power from Symbl is a tart hybrid with lemon and pine aromas, earthy and woody flavours, and hints of diesel.
|
Dried Flowers
|
15.0
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Nerolidol,Humulene,Limonene,Myrcene
|
00882464000150
|
1541547120
|
Indica dominant
|
0.0
|
10.25
|
Dreamweaver
|
Symbl
|
1.0
|
16.0
|
Dreamweaver from Symbl is comparable to MK Ultra. It’s an indoor indica-dominant hybrid with flavours of pine, spice and citrus.
|
Dried Flowers
|
10.0
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Myrcene,Beta-Pinene,Alpha Pinene,Guaiol
|
00628582000517
|
1541547120
|
Sativa dominant
|
0.0
|
10.4
|
Tangerine Dream
|
San Rafael
|
1.0
|
18.0
|
Sativa-dominant Tangerine Dream has a citrus aroma and deep purple buds, and a very high level of THC content.
|
Dried Flowers
|
11.0
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
To download data sign in with GitHub
rows 10 / 326110
timestamp | brand | name | size | availability | price |
---|---|---|---|---|---|
1542073184
|
Solei
|
Balance
|
7.0
|
1280
|
|
1542073184
|
Redecan
|
CBD Shark Shock
|
7.0
|
1246
|
|
1542073184
|
Redecan
|
B.E.C.
|
7.0
|
1339
|
|
1542073184
|
SYNR.G
|
Fantasy Island
|
7.0
|
0
|
|
1542073184
|
Redecan
|
White Shark
|
7.0
|
0
|
|
1542073184
|
Redecan
|
Cold Creek Kush
|
7.0
|
0
|
|
1542073184
|
Redecan
|
Shishkaberry
|
7.0
|
2356
|
|
1542073184
|
Redecan
|
Wappa
|
7.0
|
3508
|
|
1542073184
|
Symbl
|
Dreamweaver
|
7.0
|
1538
|
|
1542073184
|
Solei
|
Gather
|
7.0
|
0
|
|
Average successful run time: 4 minutes
Total run time: 12 months
Total cpu time used: 7 days
Total disk space used: 111 MB