fix: docker exec -i fuer Heredoc-Stdin in Auto-Ingest

Bug: Index-Scrape-Bloecke fuer HH/HE/SH produzierten 0 Protokolle, weil
\`docker exec ... python <<EOF\` ohne -i den Heredoc-Stdin nicht an
den Container weiterleitet.

Symptom in /tmp/aip.log:
  --- HH WP23 (Index-Scrape) ---
  --- HE WP21 (Index-Scrape) ---
  --- SH WP20 (Index-Scrape) ---
  (keine Output-Zeilen, exit 0)

Fix: docker exec -i an allen 3 Stellen.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
This commit is contained in:
Dotty Dotter 2026-04-29 01:37:42 +02:00
parent 7ebdc78331
commit 329c6e25e5

View File

@ -40,7 +40,7 @@ echo "=== auto-ingest-protocols $(date -Iseconds) ==="
# Hamburg hat keine vorhersagbare URL-Pattern (Blob-IDs + Hashes).
# Stattdessen: Index-Seite scrapen, jedes gefundene PDF einzeln ingesten.
echo "--- HH WP23 (Index-Scrape) ---"
docker exec "$CONTAINER" python <<EOF
docker exec -i "$CONTAINER" python <<EOF
import re, sys
import urllib.request
import sqlite3
@ -102,7 +102,7 @@ EOF
# Hessen-Beschlussprotokoll-URL-Pattern hat Datum-Anteil DD-MM-YYYY,
# daher Pattern via starweb-Index extrahieren statt vorzuhersagen.
echo "--- HE WP21 (Index-Scrape) ---"
docker exec "$CONTAINER" python <<'EOF'
docker exec -i "$CONTAINER" python <<'EOF'
import re, sys
import urllib.request
import sqlite3
@ -162,7 +162,7 @@ EOF
# ─── SH: Index-Page-Scrape (URL enthaelt Jahr + MM-YY-Suffix) ─────────
# SH-URL hat Jahr-Pfad + MM-YY-Suffix, daher Index-Scrape.
echo "--- SH WP20 (Index-Scrape) ---"
docker exec "$CONTAINER" python <<'EOF'
docker exec -i "$CONTAINER" python <<'EOF'
import re, sys
import urllib.request
import sqlite3