Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add GPU Compute Error tracking to API #150

Open
wants to merge 1 commit into
base: dev
Choose a base branch
from
Open

Conversation

alferz
Copy link

@alferz alferz commented Jul 26, 2018

This commit will add a tracking mechanism to log GPU Compute Errors by thread. This could be useful for automated scripts/monitoring setups where keeping track and alerting on GPU Compute Errors is an early indicator of impending failure. API output will include a new "results" member as shown below where the object name is equal to the thread with the failure, and the object value represents the number of failures.

"gpuComputeErrors": {
"1": 1,
"2": 2,
"3": 1,
"4": 2,
"5": 1,
"6": 4,
"7": 4
}

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant