Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BUG] - cardano-testnet sometimes hangs indefinitely #5762

Open
carbolymer opened this issue Apr 8, 2024 · 5 comments
Open

[BUG] - cardano-testnet sometimes hangs indefinitely #5762

carbolymer opened this issue Apr 8, 2024 · 5 comments
Assignees
Labels
priority: high Needs to be addressed as soon as possible, probably within current sprint. type: bug Something is not working

Comments

@carbolymer
Copy link
Contributor

carbolymer commented Apr 8, 2024

Internal/External
Internal if an IOHK staff member.

Area

Other Any other topic (Delegation, Ranking, ...).

Summary
Sometimes a cardano-testnet test suite hangs indefinitely. It's like nodes are taking longer time to produce blocks. It may be related to what @james-iohk described here #5679 (comment)

The issue is more visible in slower machines, like macos runner in GHA or darwin cross-compilation in Hydra.

Steps to reproduce
Steps to reproduce the behavior:

  1. Rerun the cardano-test suite multiple times, some of the tests should either get stuck or fail on a condition check in byDeadlineM.

The issue appears to appear more frequently when running testnet test suites in parallel.

Note

Testnet tests can be executed in parallel using PARALLEL_TESTNETS=1 environment variable or by setting --test-options '--num-threads 8' in cabal test cardano-testnet execution (after that PR gets merged).

Sample log of a failure: babbagetransaction.txt (taken from: https://github.com/IntersectMBO/cardano-node/pull/5695/checks?check_run_id=22357754517)

Expected behavior
cardano-testnet does not hang, or retries, reports the failure with message explaining what happened.

@carbolymer carbolymer added the type: bug Something is not working label Apr 8, 2024
@carbolymer
Copy link
Contributor Author

carbolymer commented Apr 8, 2024

Initially, byDeadlineM usage was considered an issue here, which was partially removed in #5707 - but instead of test failures we started getting cardano-tesnet freezes.
A suspicion here is that the test network is not advancing - the new blocks are not produced.

@smelc smelc added the priority: high Needs to be addressed as soon as possible, probably within current sprint. label Apr 9, 2024
@newhoggy newhoggy self-assigned this Apr 10, 2024
@carbolymer carbolymer mentioned this issue Apr 15, 2024
9 tasks

This comment was marked as outdated.

This comment was marked as outdated.

This comment was marked as outdated.

@github-actions github-actions bot added the Stale label Jul 20, 2024
@carbolymer carbolymer removed the Stale label Jul 22, 2024
@carbolymer carbolymer self-assigned this Sep 16, 2024
@carbolymer
Copy link
Contributor Author

carbolymer commented Sep 16, 2024

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
priority: high Needs to be addressed as soon as possible, probably within current sprint. type: bug Something is not working
Projects
None yet
Development

No branches or pull requests

3 participants