作者ctr1 (【積π】)
看板Python
標題[問題] 爬蟲取得onclick內所帶的參數
時間Thu Feb 20 10:00:37 2020
想請問板上各位爬蟲系的前輩
該如何分別取得tr > onclick="goDetail()"內的值
如下面程式碼紅色的部分
謝謝各位爬蟲系前輩
---程式碼---
url = '略'
params = '略'
html = requests.post(url, params)
soup = BeautifulSoup(html.text, 'html.parser')
result = soup.select_one('div #result tbody')
print(result)
[out:]
<tbody>
<tr class="td_odd" onclick="goDetail(
'101','70-1-44-1(035)')"
style="cursor:pointer;">
<td align="center" valign="middle">1</td>
<td align="center" class="td-line" valign="middle">效期內</td>
</tr>
<tr class="td_event" onclick="goDetail(
'301','40-107-4-1(9918)')"
style="cursor:pointer;">
<td align="center" valign="middle">2</td>
<td align="center" class="td-line" valign="middle">效期內</td>
</tr>
<tr class="td_odd" onclick="goDetail(
'502','53-1-42-1(531)')"
style="cursor:pointer;">
<td align="center" valign="middle">1</td>
<td align="center" class="td-line" valign="middle">效期內</td>
</tr>
</tbody>
--
※ 發信站: 批踢踢實業坊(ptt.cc), 來自: 210.242.44.65 (臺灣)
※ 文章網址: https://www.ptt.cc/bbs/Python/M.1582164039.A.948.html
→ vi000246: 用正規表達式硬幹 02/20 10:46
推 yushes920179: 先把那段字串拿出來 再re 02/20 17:05
→ froce: 5066ea9f/?m=Saved%20fiddle 02/20 19:04
→ froce: 何必硬幹呢,onclick也是個attr而已啊 02/20 19:05
推 jiyu520: 抓attribute outerHTML喔 02/21 03:22
→ jiyu520: 抓attribute或直接找outerHTML喔 02/21 03:23