Initial stab at facilitating /proc/stat sampling#755
Initial stab at facilitating /proc/stat sampling#755oschaaf wants to merge 2 commits intoistio:masterfrom
Conversation
See README.md for contents Signed-off-by: Otto van der Schaaf <oschaaf@we-amp.com>
Signed-off-by: Otto van der Schaaf <oschaaf@we-amp.com>
|
Hi @oschaaf. Thanks for your PR. I'm waiting for a istio member to verify that this patch is reasonable to test. If it is, they should reply with Once the patch is verified, the new status will be reflected by the I understand the commands that are listed here. DetailsInstructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
howardjohn
left a comment
There was a problem hiding this comment.
Sorry I am not too familiar with this, just wondering how it compares/relates to https://github.com/prometheus/node_exporter?
|
Node exporter looks very promising. It has great coverage. . @oschaaf if it supports higher sampling rate, ability to run on vms and baremetal, then we have full coverage. |
|
Looking at node exporter, this is similar, yet different. Summarizing findings after a short exploration:
I might be overly paranoid, but on the one hand, to me it seems that doing less means less (chance of) introducing noise into the actual measurements we're doing. And also directly streaming collected samples to storage seems like a good way to make sure all captured information is retained for future reference (or at least know about it when something is lost). |
|
@oschaaf apologies for delayed response. I think that using an existing tool is preferred until it is proven to be an issue, like excessive cost of sampling. It would be better to send PRs to nodeexporter if excessive re-opening of files becomes an issue. I think we should document this work, and re-open it in the future if nodeexporter is unworkable. |
|
Closing this one, as instead #890 will bring up |
This contains a first stab at a service which can be deployed to containers to sample proc stat, and expose results in Prometheus format over http. Creating an early PR for discussion.
This also has a shell script that will build a standalone binary for the python service, and deploy/start that in the containers that run the istio-injected side car proxies. On top of that there's a few bits and pieces that might come around useful later, if we want to stream the raw time-series data to disk for later inspection.
Please see README.md for more PR details.