Resolve common issues when using Chisel CLI with comprehensive troubleshooting guides and solutions.

Quick Issue Resolution

Most issues can be resolved quickly by checking authentication, upload size, and GPU memory requirements.
Issue TypeQuick CheckCommon Solution
Authenticationchisel.auth.is_authenticated()Re-authenticate with chisel command
Upload failsDirectory size < 100MBAdd files to .gitignore
GPU memoryReduce batch sizeUse larger GPU type
Script not foundScript in upload directoryCheck upload_dir parameter
Network issuesCheck connectivityVerify CHISEL_BACKEND_URL

Installation Issues

Authentication Issues

Runtime Issues

Performance Issues

Network and Connectivity

Debugging Tools

Common Error Patterns

Quick reference for the most common error patterns and their solutions:
Error MessageCauseQuick Fix
ModuleNotFoundError: No module named 'chisel'Not installedpip install chisel-cli
CUDA out of memoryInsufficient GPU memoryReduce batch size or use larger GPU
Script not foundWrong upload directoryCheck upload_dir parameter
Authentication failedInvalid credentialsclear_credentials() and re-authenticate
Connection timeoutNetwork issuesCheck connectivity and backend URL
Work upload failedDirectory too largeAdd files to .gitignore

Getting Help

If you’re still experiencing issues after trying these solutions, gather the following information before seeking help:

Information to Collect

  1. System Information (use the debug script above)
  2. Complete Error Output (full traceback)
  3. Minimal Reproduction Code (smallest example that fails)
  4. Job ID (if applicable from Chisel output)
  5. Network Environment (corporate, home, cloud, etc.)

Support Channels

Reporting Template

When reporting issues, use this template:
## Issue Description
Brief description of the problem

## Environment
- OS: [e.g., Ubuntu 22.04, macOS 13.0, Windows 11]
- Python: [e.g., 3.9.7]
- PyTorch: [e.g., 2.0.1]
- Chisel CLI: [version or commit hash]

## Steps to Reproduce
1. Step one
2. Step two
3. Step three

## Expected Behavior
What you expected to happen

## Actual Behavior
What actually happened

## Error Output
Complete error message and traceback

## Additional Context
- Job ID (if applicable)
- Network environment
- Any workarounds tried