Beberapa halaman merender dua atau lebih CAPTCHA — form login dengan reCAPTCHA dan pendaftaran newsletter dengan reCAPTCHA kedua, atau form multi-step yang setiap langkahnya memicu challenge tersendiri. Setiap CAPTCHA memiliki sitekey dan target element yang unik, jadi Anda perlu mendeteksi semuanya, solve secara paralel, dan inject setiap token ke callback yang benar.
Mendeteksi Beberapa CAPTCHA
Python dengan Selenium
from selenium import webdriver
from selenium.webdriver.common.by import By
driver = webdriver.Chrome()
driver.get("https://example.com/multi-captcha-page")
# Find all reCAPTCHA iframes
captcha_iframes = driver.find_elements(
By.CSS_SELECTOR, 'iframe[src*="recaptcha/api2/anchor"]'
)
# Extract sitekeys from each iframe's src
import re
captchas = []
for i, iframe in enumerate(captcha_iframes):
src = iframe.get_attribute("src")
match = re.search(r"k=([A-Za-z0-9_-]+)", src)
if match:
captchas.append({
"index": i,
"sitekey": match.group(1),
"iframe": iframe
})
print(f"Found {len(captchas)} CAPTCHAs on page")
for c in captchas:
print(f" [{c['index']}] sitekey: {c['sitekey']}")
Hasil yang diharapkan:
Found 2 CAPTCHAs on page
[0] sitekey: 6LcXyzABCDEF-login
[1] sitekey: 6LcAbcDEFGHI-signup
JavaScript dengan Puppeteer
const puppeteer = require('puppeteer');
(async () => {
const browser = await puppeteer.launch({ headless: false });
const page = await browser.newPage();
await page.goto('https://example.com/multi-captcha-page', {
waitUntil: 'networkidle2'
});
// Extract all reCAPTCHA widgets from the page
const captchas = await page.evaluate(() => {
const widgets = document.querySelectorAll('.g-recaptcha');
return Array.from(widgets).map((el, i) => ({
index: i,
sitekey: el.getAttribute('data-sitekey'),
elementId: el.id || `captcha-${i}`,
callbackName: el.getAttribute('data-callback') || null
}));
});
console.log(`Found ${captchas.length} CAPTCHAs`);
captchas.forEach(c => console.log(` [${c.index}] ${c.sitekey}`));
})();
Solve Semua CAPTCHA secara Paralel
Kirim semua CAPTCHA ke CaptchaAI sekaligus, lalu polling sampai masing-masing selesai.
Python
import requests
import time
from concurrent.futures import ThreadPoolExecutor, as_completed
API_KEY = "YOUR_API_KEY"
PAGE_URL = "https://example.com/multi-captcha-page"
def submit_captcha(sitekey):
resp = requests.post("https://ocr.captchaai.com/in.php", data={
"key": API_KEY,
"method": "userrecaptcha",
"googlekey": sitekey,
"pageurl": PAGE_URL,
"json": "1"
})
result = resp.json()
if result["status"] != 1:
raise Exception(f"Submit error: {result['request']}")
return result["request"]
def poll_result(task_id, timeout=120):
deadline = time.time() + timeout
while time.time() < deadline:
time.sleep(5)
resp = requests.get("https://ocr.captchaai.com/res.php", params={
"key": API_KEY,
"action": "get",
"id": task_id,
"json": "1"
})
result = resp.json()
if result["status"] == 1:
return result["request"]
if result["request"] != "CAPCHA_NOT_READY":
raise Exception(f"Poll error: {result['request']}")
raise TimeoutError(f"Task {task_id} timed out")
def solve_all(captchas):
# Submit all in parallel
task_ids = {}
with ThreadPoolExecutor(max_workers=len(captchas)) as pool:
futures = {
pool.submit(submit_captcha, c["sitekey"]): c["index"]
for c in captchas
}
for future in as_completed(futures):
idx = futures[future]
task_ids[idx] = future.result()
print(f"[captcha-{idx}] Submitted → task {task_ids[idx]}")
# Poll all in parallel
tokens = {}
with ThreadPoolExecutor(max_workers=len(task_ids)) as pool:
futures = {
pool.submit(poll_result, tid): idx
for idx, tid in task_ids.items()
}
for future in as_completed(futures):
idx = futures[future]
tokens[idx] = future.result()
print(f"[captcha-{idx}] Solved")
return tokens
# Example usage
captchas = [
{"index": 0, "sitekey": "6LcXyzABCDEF-login"},
{"index": 1, "sitekey": "6LcAbcDEFGHI-signup"}
]
tokens = solve_all(captchas)
Hasil yang diharapkan:
[captcha-0] Submitted → task 71823456
[captcha-1] Submitted → task 71823457
[captcha-1] Solved
[captcha-0] Solved
Inject Token ke Widget yang Benar
Setiap widget reCAPTCHA memiliki textarea g-recaptcha-response sendiri. Jika ada beberapa widget, setiap textarea bersarang di dalam container widget-nya.
Python (Selenium)
def inject_tokens(driver, captchas, tokens):
for c in captchas:
idx = c["index"]
token = tokens[idx]
# Find the textarea within the widget's container
container = driver.find_elements(By.CSS_SELECTOR, ".g-recaptcha")[idx]
textarea = container.find_element(
By.CSS_SELECTOR, 'textarea[name="g-recaptcha-response"]'
)
driver.execute_script(
"arguments[0].value = arguments[1];", textarea, token
)
# Trigger the callback if defined
callback = c.get("callback")
if callback:
driver.execute_script(f"{callback}('{token}');")
print(f"[captcha-{idx}] Token injected")
inject_tokens(driver, captchas, tokens)
JavaScript (Puppeteer)
async function injectTokens(page, captchas, tokens) {
for (const captcha of captchas) {
const token = tokens[captcha.index];
await page.evaluate((idx, tkn, callbackName) => {
const widgets = document.querySelectorAll('.g-recaptcha');
const textarea = widgets[idx].querySelector(
'textarea[name="g-recaptcha-response"]'
);
textarea.value = tkn;
if (callbackName && typeof window[callbackName] === 'function') {
window[callbackName](tkn);
}
}, captcha.index, token, captcha.callbackName);
console.log(`[captcha-${captcha.index}] Token injected`);
}
}
await injectTokens(page, captchas, tokens);
CAPTCHA Tipe Campuran
Jika halaman memiliki tipe CAPTCHA berbeda (misalnya reCAPTCHA + Turnstile), deteksi setiap tipe secara terpisah:
def detect_all_captchas(driver):
detected = []
# reCAPTCHA
recaptchas = driver.find_elements(By.CSS_SELECTOR, ".g-recaptcha")
for i, el in enumerate(recaptchas):
detected.append({
"type": "userrecaptcha",
"sitekey": el.get_attribute("data-sitekey"),
"label": f"recaptcha-{i}"
})
# Turnstile
turnstiles = driver.find_elements(By.CSS_SELECTOR, ".cf-turnstile")
for i, el in enumerate(turnstiles):
detected.append({
"type": "turnstile",
"sitekey": el.get_attribute("data-sitekey"),
"label": f"turnstile-{i}"
})
return detected
Kirim masing-masing dengan parameter method yang sesuai — userrecaptcha untuk reCAPTCHA, turnstile untuk Turnstile.
Pemecahan Masalah
| Masalah | Penyebab | Solusi |
|---|---|---|
| Token di-inject tapi form masih diblokir | Callback tidak dipanggil | Periksa data-callback dan panggil dengan token |
| Hanya CAPTCHA pertama yang terdeteksi | CAPTCHA kedua dimuat lambat | Tunggu semua iframe/widget muncul sebelum scan |
| Token salah di widget yang salah | Index mismatch | Map token ke sitekey, bukan posisi index |
ERROR_WRONG_GOOGLEKEY |
Sitekey diekstrak salah | Verifikasi sitekey dari atribut src iframe atau data-sitekey |
Pertanyaan Umum
Bisakah satu halaman memiliki sitekey berbeda untuk setiap CAPTCHA?
Ya. Setiap widget dapat menggunakan sitekey terpisah. Selalu ekstrak sitekey per widget, jangan asumsikan mereka berbagi satu sitekey.
Haruskah saya solve secara berurutan atau paralel?
Secara paralel. Setiap solve membutuhkan 15–30 detik. Solve dua CAPTCHA secara paralel membutuhkan waktu sama dengan solve satu.
Bagaimana jika CAPTCHA kedua hanya muncul setelah submit form pertama?
Scan ulang halaman setelah setiap form submit untuk mendeteksi CAPTCHA yang baru dirender. Gunakan WebDriverWait (Selenium) atau page.waitForSelector (Puppeteer) untuk menunggu widget baru.
Panduan Terkait
- Ekstraksi CAPTCHA Iframe: Solve CAPTCHA dalam Frame Bersarang
- Mengekstrak Parameter reCAPTCHA dari Source Halaman