Skip to content

Data for ml factor analysis #2

@MislavSag

Description

@MislavSag

I started to read your book. I have finished chapters 1 and 2. In the book, you use the following data:

This dataset comprises information on 1,207 stocks listed in the US (possibly originating from Canada or Mexico). The time range starts in November 1998 and ends in March 2019. For each point in time, 93 characteristics describe the firms in the sample.

This data is anonymous. We don't know which stock is represented by id.

It would be very helpful if you can give some tips in the book how to get data in the first place. It would be very helpful for beginners in ml factor analysis (like me) who don't have data yet. This would be the first step if we would like to follow you analysis with real stocks.

I even have subscriptions on interactive brokers, but they don't have data on quarterly financial statements, only annual financial statements.

In nutshell, do you have any suggestions on how to obtain good data for ml factor analysis (good quality, as cheap as possible, as older as possible)?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions