
Maeghan Lefebvre Manager • about 8 years ago
Datasets
The Boston Cleanweb Hackathon organizers have compiled a number of datasets that your team may find useful for your project. You can find them in our Dropbox folder: https://www.dropbox.com/l/b5pIJMjjtv6seFtY9pzTbu
Comments are closed.
2 comments
Maeghan Lefebvre Manager • about 8 years ago
EnerNOC has provided more data for the Cleanweb Hackathon. Check it out here!
https://drive.google.com/open?id=0Bwka9tAt_wtkblktd0hHNW93VHM&authuser=0
https://drive.google.com/open?id=0Bwka9tAt_wtkblktd0hHNW93VHM&authuser=0
AngusShaw Manager • about 8 years ago
Explanation of Enernoc Data
Luis Sano-Espinosa
April 9, 2015
Each csv file has the following schema:
glimpse(intervalData)
## Observations: 105122
## Variables:
## $ siteid (fctr) 0023269678bd0206753caa3136812160, 0023269678bd0206...
## $ meterid (fctr) 9262a1cbb1b2dcb04a4b703c8f341029, 9262a1cbb1b2dcb0...
## $ dttm (fctr) 2014-01-01 00:00:00, 2014-01-01 00:05:00, 2014-01-...
## $ demand_kWh (dbl) 14.88, 14.88, 16.80, 14.88, 15.36, 15.36, 13.92, 14...
Containing 5 minute electricity interval data from 2014. Each file includes all meters that are installed at an EnerNOC customer location and can vary between 1 to N meters. All timestamps that appear in these files are in UTC time.
We have also included a site attribute file that you can use to lookup the following properties for a site:
glimpse(siteAttributes)
## Observations: 248
## Variables:
## $ siteid (fctr) a6037ce7e019fa6a1e73395d8c8299f9, 27495cc4fd15b05...
## $ lat (dbl) 28.18526, 40.76174, 40.77980, 40.01695, 39.27323, ...
## $ lng (dbl) -81.80134, -74.03539, -75.94105, -74.80028, -75.90...
## $ industry (fctr) Commercial Property, Light Industrial, Heavy Indu...
## $ subindustry (fctr) Big Box Retail, Manufacturing, Other Heavy Indust...
## $ timezone (fctr) America/New_York, America/New_York, America/New_Y...
The Lat/Long coordinates in this file have been randomized within a certain radius of the true location. So there might be situations when mapping a site���s location and it might show up in the ocean or a lake.
There are a total of 248 sites in this dataset and the distribution across industries is as follows:
print(industries)
lsano@enernoc.com