fix: docker exec -i fuer Heredoc-Stdin in Auto-Ingest
Bug: Index-Scrape-Bloecke fuer HH/HE/SH produzierten 0 Protokolle, weil \`docker exec ... python <<EOF\` ohne -i den Heredoc-Stdin nicht an den Container weiterleitet. Symptom in /tmp/aip.log: --- HH WP23 (Index-Scrape) --- --- HE WP21 (Index-Scrape) --- --- SH WP20 (Index-Scrape) --- (keine Output-Zeilen, exit 0) Fix: docker exec -i an allen 3 Stellen. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
This commit is contained in:
parent
7ebdc78331
commit
329c6e25e5
@ -40,7 +40,7 @@ echo "=== auto-ingest-protocols $(date -Iseconds) ==="
|
||||
# Hamburg hat keine vorhersagbare URL-Pattern (Blob-IDs + Hashes).
|
||||
# Stattdessen: Index-Seite scrapen, jedes gefundene PDF einzeln ingesten.
|
||||
echo "--- HH WP23 (Index-Scrape) ---"
|
||||
docker exec "$CONTAINER" python <<EOF
|
||||
docker exec -i "$CONTAINER" python <<EOF
|
||||
import re, sys
|
||||
import urllib.request
|
||||
import sqlite3
|
||||
@ -102,7 +102,7 @@ EOF
|
||||
# Hessen-Beschlussprotokoll-URL-Pattern hat Datum-Anteil DD-MM-YYYY,
|
||||
# daher Pattern via starweb-Index extrahieren statt vorzuhersagen.
|
||||
echo "--- HE WP21 (Index-Scrape) ---"
|
||||
docker exec "$CONTAINER" python <<'EOF'
|
||||
docker exec -i "$CONTAINER" python <<'EOF'
|
||||
import re, sys
|
||||
import urllib.request
|
||||
import sqlite3
|
||||
@ -162,7 +162,7 @@ EOF
|
||||
# ─── SH: Index-Page-Scrape (URL enthaelt Jahr + MM-YY-Suffix) ─────────
|
||||
# SH-URL hat Jahr-Pfad + MM-YY-Suffix, daher Index-Scrape.
|
||||
echo "--- SH WP20 (Index-Scrape) ---"
|
||||
docker exec "$CONTAINER" python <<'EOF'
|
||||
docker exec -i "$CONTAINER" python <<'EOF'
|
||||
import re, sys
|
||||
import urllib.request
|
||||
import sqlite3
|
||||
|
||||
Loading…
Reference in New Issue
Block a user