glam/.gitignore
kempersc 90a1f20271 chore: add YAML history fix scripts and update ducklake/deploy tooling
- Add fix_yaml_history.py and fix_yaml_history_v2.py for cleaning up
  malformed ghcid_history entries with duplicate/redundant data
- Update load_custodians_to_ducklake.py for DuckDB lakehouse loading
- Update migrate_web_archives.py for web archive management
- Update deploy.sh with improvements
- Ignore entire data/ducklake/ directory (generated databases)
2025-12-07 18:45:52 +01:00

112 lines
1.2 KiB
Text

# Python
__pycache__/
*.py[cod]
*$py.class
*.so
.Python
build/
develop-eggs/
dist/
downloads/
eggs/
.eggs/
/lib/
lib64/
parts/
sdist/
var/
wheels/
share/python-wheels/
*.egg-info/
.installed.cfg
*.egg
MANIFEST
# documents
docs/invoice
# Virtual environments
venv/
env/
ENV/
env.bak/
venv.bak/
.venv/
# Poetry
poetry.lock
.poetry/
# Testing
.tox/
.nox/
.coverage
.coverage.*
.cache
nosetests.xml
coverage.xml
*.cover
*.log
.pytest_cache/
htmlcov/
# Type checking
.mypy_cache/
.dmypy.json
dmypy.json
.pytype/
.pyre/
# IDEs
.vscode/
.idea/
*.swp
*.swo
*~
.DS_Store
# Jupyter
.ipynb_checkpoints/
*.ipynb
# Environment variables
.env
.env.local
.env.*.local
# Documentation
docs/_build/
site/
# spaCy models
*.spacy
*.bin
# Data files (large CSVs, JSONs, etc.)
# Uncomment if you don't want to track large data files
# *.csv
# *.json
# *.parquet
# *.db
# *.sqlite
# Output files
output/
exports/
cache/
# Logs
logs/
*.log
# Infrastructure secrets
infrastructure/terraform/terraform.tfvars
infrastructure/terraform/*.auto.tfvars
infrastructure/terraform/.terraform/
infrastructure/terraform/*.tfstate*
# DuckDB lakehouse databases
data/ducklake/*.duckdb
data/ducklake/*.duckdb.wal
data/ducklake/