Skip to content

Commit 2cba48c

Browse files
lkvehlinglkvehling
andauthored
Add Lillie pipeline markdown (#4)
Co-authored-by: lkvehling <lkvehling@ucdavis.edu>
1 parent 0e975e9 commit 2cba48c

1 file changed

Lines changed: 27 additions & 0 deletions

File tree

TODO_PIPELINES/lillie_pipeline.md

Lines changed: 27 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,27 @@
1+
## APIs/Data Resources:
2+
1. https://collegefootballdata.com/
3+
- advertises free data on NCAA football
4+
- not sure if it is just FBS or FCS also
5+
- not sure on its speed for updates or accuracy but seems like a starting place
6+
2. https://www.basketball-reference.com/
7+
- another supposedly free site on NBA, ABA, G League, and WNBA
8+
- team, player, league information
9+
- not sure how the data is downloaded
10+
3. https://www.sports-reference.com/
11+
- more general sports from basketball-reference
12+
- includes more sports include baseball, football pro and college, basketball pro and college
13+
- not sure on update time but has good historical data
14+
- not betting specific
15+
4. https://www.kaggle.com/datasets/ehallmar/nba-historical-stats-and-betting-data
16+
- money lines betting information for NBA games
17+
5. https://www.kaggle.com/datasets/scottfree/sports-lines
18+
- betting information for line, over/under, and game results for select seasons of select sports
19+
- offer variety and also an AlphaPy python model to analyze the trend data in the game results
20+
21+
## High Level WorkFlow
22+
1. Collect data from APIs or data resources
23+
2. Filter and Clean Data into desired values and parameters
24+
3. Split the data into train and test
25+
4. Fit a linear regression model
26+
5. Evaluate accuracy --> RSME, MAE, R^2
27+
6. Adjust and improve

0 commit comments

Comments
 (0)