Add how-to guide to analyse company coverage by oscar-si · Pull Request #9 · Bigdata-com/bigdata-docs-resources

oscar-si · 2026-05-22T10:26:31Z

No description provided.

ddeaguilar · 2026-05-25T09:30:35Z

+
+## Prerequisites
+
+- Python 3.7+


Even if the script works with a previous version, I think we should encourage people to use a more recent version and include that here. Something like:

Python 3.7+ (We recommend 3.10+)

Or directly mention as a prerequisite Python 3.10+. Otherwise it looks like this script is outdated.

ddeaguilar · 2026-05-25T10:06:23Z

@@ -0,0 +1,7 @@
+name,mic,ticker,isin,cusip,sedol,ravenpack_id,country,industry
+Micron Technology Inc.,,,US5951121038,,,49BBBC,US,Semiconductors


We can replace these input files with only one

name,ravenpack_id Apple Inc.,4A6F00 NVIDIA Corporation,52258B `python

ddeaguilar · 2026-05-25T10:07:03Z

+Micron Technology Inc.,,,US5951121038,,,49BBBC,US,Semiconductors
+NVIDIA Corporation,,,,,2379504,E09E2B,US,Semiconductors
+Figma Inc.,,,,316841105,,BA9E0C,US,Internet Services
+Microsoft Corporation,XNAS,MSFT,,,,228D42,US,Software


Also this script doesn't care about public vs private companies so I would rename the default input file to just company_ids.csv and make a note saying that you can pass the files from the other how to guide

ddeaguilar · 2026-05-25T10:08:39Z

+
+
+def _coverage_fields(label: str) -> tuple[str, str]:
+    return f"distinct_documents_{label}", f"distinct_chunks_{label}"


It ends up being a very long name, what do you think about:

distinct_documents_XXX -> docs_XXX
distinct_chunks_XXX -> chunks_XXX

We can still mention the distinct word in the README

ddeaguilar · 2026-05-25T10:22:10Z

+```csv
+name,ravenpack_id
+Apple Inc.,4A6F00
+NVIDIA Corporation,52258B


This is not nvidia ID, the RP id is: E09E2B

Add how-to guide to analyse company coverage

420a5ae

oscar-si requested a review from ddeaguilar May 22, 2026 10:26

oscar-si and others added 2 commits May 25, 2026 11:22

Add an example private_company_ids.csv

cd0fa9a

Handling errors on volume endpoint and lowering fields

470b78d

ddeaguilar reviewed May 25, 2026

View reviewed changes

Tune README.md with comments

ea65fa8

oscar-si merged commit aa55d61 into main May 25, 2026

oscar-si deleted the company-coverage branch May 25, 2026 10:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add how-to guide to analyse company coverage#9

Add how-to guide to analyse company coverage#9
oscar-si merged 4 commits into
mainfrom
company-coverage

oscar-si commented May 22, 2026

Uh oh!

ddeaguilar May 25, 2026

Uh oh!

ddeaguilar May 25, 2026

Uh oh!

ddeaguilar May 25, 2026

Uh oh!

ddeaguilar May 25, 2026

Uh oh!

ddeaguilar May 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		@@ -0,0 +1,7 @@
		name,mic,ticker,isin,cusip,sedol,ravenpack_id,country,industry
		Micron Technology Inc.,,,US5951121038,,,49BBBC,US,Semiconductors



		def _coverage_fields(label: str) -> tuple[str, str]:
		return f"distinct_documents_{label}", f"distinct_chunks_{label}"

Conversation

oscar-si commented May 22, 2026

Uh oh!

ddeaguilar May 25, 2026

Choose a reason for hiding this comment

Uh oh!

ddeaguilar May 25, 2026

Choose a reason for hiding this comment

Uh oh!

ddeaguilar May 25, 2026

Choose a reason for hiding this comment

Uh oh!

ddeaguilar May 25, 2026

Choose a reason for hiding this comment

Uh oh!

ddeaguilar May 25, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants