-
Notifications
You must be signed in to change notification settings - Fork 9
Problem Design Statement #11
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Open
yk2684
wants to merge
15
commits into
feature-engineering-studio:master
Choose a base branch
from
yk2684:master
base: master
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
Open
Changes from all commits
Commits
Show all changes
15 commits
Select commit
Hold shift + click to select a range
b6ce1a1
Problem Design Statement
yk2684 667f33f
Fake Data
yk2684 ea01648
Create In progress
yk2684 06e9afe
Delete In progress
yk2684 8bf1280
Create In progress
yk2684 cada875
Create Notes
yk2684 4dbe9a4
Update In progress
yk2684 2165d81
Update Notes
yk2684 be4ed6c
Update Notes
yk2684 3df6165
Update Notes
yk2684 1a785d2
Update In progress
yk2684 31939e5
Update Notes
yk2684 ef228d4
Update In progress
yk2684 00acfc6
Update Notes
yk2684 3749c34
Update In progress
yk2684 File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
Binary file not shown.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,28 @@ | ||
| I genuinely believe that music streaming is the future of the industry, and it has made music more accessible to people. However, there are many critics who believe that music streaming is hurting the industry, and that music streaming services do not pay artists enough money. | ||
|
|
||
| I want to better understand the impact of music streaming services on the music industry and artists based on revenues generated within the industry for artists. I also wish to identify which category of artists (based on popularity) benefit from or are hurt by these music streaming services. In completing this goal, I hope to clarify the misconceptions surrounding music streaming services, and if necessary, inform better royalty pool structures for current and future music streaming services in order to better serve artists. In achieving this, I plan to collect data on revenues generated in the US music industry by format (CD sales, concert ticket sales, vinyl sales, etc.) over 30+ years. Then, I will categorize artists based on their popularity by looking at social media likes/follows, and will collect data on their estimated revenues based on the existing royalty pool structures established by streaming services. Because my main goal is to demystify the impact music streaming services have on the music industry, my priority will lie in collecting data on revenues earned in the music industry. | ||
|
|
||
| EDIT | ||
| Goal: I want to better understand the impact of music streaming services on the music industry and artists based on revenues generated within the industry for artists. I also wish to identify which category of artists (based on popularity) benefit from or are hurt by these music streaming services. | ||
|
|
||
| Input: Secondary research on what industry people are saying about streaming services, what artists are saying about streaming services. If there are complaints, why are they complaining about it. Collecting data on number of artsits utilizing streaming services, and see which kinds of artists are utilizing streaming services more. Acknoledge that music streaming services is good. | ||
| Activities: Production and sale of music , revenue generated from the sale | ||
| Output: I plan to collect data on revenues generated in the US music industry by format (CD sales, concert ticket sales, vinyl sales, etc.) over 30+ years. Then, I will categorize artists based on their popularity by looking at social media likes/follows, and will collect data on their estimated revenues based on the existing royalty pool structures established by streaming services. | ||
| Outcomes: I want artists to utilize music streaming services to their benefit and see them as a beneficial tool for them as opposed to something that hurts them - prevent exclusives | ||
|
|
||
| build model??? predicting model how to do that? | ||
|
|
||
| - how to present | ||
| - what is going on? | ||
| - argument | ||
|
|
||
| - Do an interactive piece | ||
| Use Plotly, | ||
|
|
||
| Hypothesis - there'something there. how can i take that to make it clear | ||
| Think through what the logic model is | ||
| What is impacting the increase? Music industry | ||
| Looking at historical data | ||
| every time streaming service there is a spike | ||
|
|
||
| pulling aggregate numbers - why I think what it means |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,41 @@ | ||
| Logic Model/Theory of Action (scientific method) | ||
|
|
||
| Input - theory we're collecting, other thoughts about it, data points we had collected, opinions from someone , more qualatative? | ||
| Activity - Activities we care about (maybe artists going to Apple Music and being vocal?) | ||
| Output - What are we counting? What can we actually capture? How granular? (how many time slices?) What meaning is ascribed to it? | ||
| Outcomes - What do we care about? (i want artists to utilize music streaming services to their benefit and see them as a beneficial tool for them as opposed to something that hurts them - prevent exclusives) | ||
|
|
||
| Tidy data | ||
| manageable data | ||
| observations in rows | ||
| variables are in columns | ||
| for 10/6 | ||
|
|
||
| 10/11 | ||
| Visualization | ||
| Three visualizations | ||
| One viz must demonstrate something you learned that relates to your logic model | ||
| R, Plotly, Tableau, excel | ||
|
|
||
| Markdown doc | ||
|
|
||
| Process | ||
| 1. Describe data | ||
| 2. Choose methodology | ||
| 3. Make notes on the purpose of the method | ||
| 4. What data structure does method need? What structure do you have? | ||
| 5. Make notes on the limitations of the method | ||
| 6. Research the code | ||
| 7. Toy data | ||
| 8. What are the expected results? | ||
| 9. Apply the code to your data | ||
|
|
||
| Feature Engineering | ||
| Using domain expertise to reason about what should be included in a model (i.e. classifier) | ||
| very similar to educational statistics | ||
|
|
||
| Construct Validity | ||
| Methodology Statement | ||
| A description of the method you plan to employ to analze your data | ||
| A list of assumptions, "gotchas" or other considerations you need to be aware of to use that method | ||
| How you plan to validate both model and constructs |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This project will hinge on whether you can get royalty info for individual artists (Or maybe the proportion of revenue an artists makes over time) and can find a trend. Is this kind of revenue data available?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Because the royalty pool structures are made available to the public (at least Spotify does), I was thinking of basically plugging in numbers to find royalty information for individual artists. However, I am not sure if I can find data on the proportion of revenue an artist makes over time. I am, however, currently interning at Spotify, so I can do some digging into being able to get artist data over time.