Skip to content

Add how-to guide to analyse company coverage#9

Merged
oscar-si merged 4 commits into
mainfrom
company-coverage
May 25, 2026
Merged

Add how-to guide to analyse company coverage#9
oscar-si merged 4 commits into
mainfrom
company-coverage

Conversation

@oscar-si
Copy link
Copy Markdown
Contributor

No description provided.

@oscar-si oscar-si requested a review from ddeaguilar May 22, 2026 10:26

## Prerequisites

- Python 3.7+
Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Even if the script works with a previous version, I think we should encourage people to use a more recent version and include that here. Something like:

  • Python 3.7+ (We recommend 3.10+)

Or directly mention as a prerequisite Python 3.10+. Otherwise it looks like this script is outdated.

@@ -0,0 +1,7 @@
name,mic,ticker,isin,cusip,sedol,ravenpack_id,country,industry
Micron Technology Inc.,,,US5951121038,,,49BBBC,US,Semiconductors
Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We can replace these input files with only one

name,ravenpack_id
Apple Inc.,4A6F00
NVIDIA Corporation,52258B

`python

Micron Technology Inc.,,,US5951121038,,,49BBBC,US,Semiconductors
NVIDIA Corporation,,,,,2379504,E09E2B,US,Semiconductors
Figma Inc.,,,,316841105,,BA9E0C,US,Internet Services
Microsoft Corporation,XNAS,MSFT,,,,228D42,US,Software
Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Also this script doesn't care about public vs private companies so I would rename the default input file to just company_ids.csv and make a note saying that you can pass the files from the other how to guide



def _coverage_fields(label: str) -> tuple[str, str]:
return f"distinct_documents_{label}", f"distinct_chunks_{label}"
Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It ends up being a very long name, what do you think about:

distinct_documents_XXX -> docs_XXX
distinct_chunks_XXX -> chunks_XXX

We can still mention the distinct word in the README

```csv
name,ravenpack_id
Apple Inc.,4A6F00
NVIDIA Corporation,52258B
Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is not nvidia ID, the RP id is: E09E2B

@oscar-si oscar-si merged commit aa55d61 into main May 25, 2026
@oscar-si oscar-si deleted the company-coverage branch May 25, 2026 10:53
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants