How to access dynamic HTML elements via web scraping?-Golang-php.cn

Home

Backend Development

Golang

How to access dynamic HTML elements via web scraping?

王林

Feb 09, 2024 am 09:51 AM

html element

如何通过网页抓取访问动态 HTML 元素？

php editor Xiaoxin here introduces a method to access dynamic HTML elements through web crawling. When we crawl web pages, we sometimes encounter dynamically generated content that cannot be obtained directly until the web page is loaded. Fortunately, there are tools and techniques we can use to solve this problem. This article will introduce a PHP-based method that can be used to easily crawl and access dynamic HTML elements. Let’s take a look!

Question content

I am using go-rod for web scraping. I want to access links within dynamic <a></a>. To make this a visible, I have to complete a searcher which is an input with the next format (without submit):

<form>
    <input> <!--this is the searcher-->
<form/>

So, when I'm done, the a I want to access appears:

Up to here, everything is fine. This is the code I use to complete the searcher:

//page's url
page := rod.new().mustconnect().mustpage("https://www.sofascore.com/")

//acept cookies alert
page.mustelement("cookiesalertselector...").mustclick()

//completes the searcher
el := page.mustelement(`searcherselector...`)
el.mustinput("lionel messi")

Now the problem arises, when I want to click on the a that appears after completing the search.

I tried this:

diviwant := page.mustelement("aselector...")
diviwant.mustclick()

and this:

diviwant := page.mustelement("aselector...").mustwaitvisible()
diviwant.mustclick()

However, they all return me the same error:

panic: {-32000 node is detached from document }
goroutine 1 [running]:
github.com/go-rod/rod/lib/utils.glob..func2({0x100742dc0?,
0x140002bad50?})
/users/lucastomicbenitez/go/pkg/mod/github.com/go-rod/[email&#160;protected]/lib/utils/utils.go:65
+0x24 github.com/go-rod/rod.gene.func1({0x14000281ca0?, 0x1003a98b7?, 0x4?})
/users/lucastomicbenitez/go/pkg/mod/github.com/go-rod/[email&#160;protected]/must.go:36
+0x64 github.com/go-rod/rod.(*element).mustclick(0x14000289320)   /users/lucastomicbenitez/go/pkg/mod/github.com/go-rod/[email&#160;protected]/must.go:729
+0x9c main.main()     /users/lucastomicbenitez/development/golang/evolutionaryalgorithm/main/main.go:22
+0x9c exit status 2

So, while looking for some solutions, I found this github issue and tried to get the link via this method:

link := page.musteval(`()=> document.queryselector('aselector...').href`)

But it returns this:

panic: eval js error: TypeError: Cannot read properties of null
(reading 'href')

However, I'm pretty sure the selector is correct. What did i do wrong?

Workaround

As @hymns for disco said in the comments, I just had to wait a while after the searcher finished.

el.MustInput("Lionel Messi")

time.Sleep(time.Second)

link := page.MustEval(`()=> document.querySelector('aSelector...').href`)

The above is the detailed content of How to access dynamic HTML elements via web scraping?. For more information, please follow other related articles on the PHP Chinese website!

Statement

This article is reproduced at:stackoverflow. If there is any infringement, please contact admin@php.cn delete

如何将HTML转换为MP4格式Feb 19, 2024 pm 02:48 PM

标题：HTML如何转换为MP4格式：详细代码示例在日常的网页制作过程中，我们常常会遇到将HTML页面或者特定的HTML元素转换为MP4视频的需求。例如将动画效果、幻灯片或其他动态元素保存为视频文件。本文将介绍如何使用HTML5和JavaScript将HTML转换为MP4格式，并提供具体的代码示例。HTML5的video标签和CanvasAPIHTML5引入

JS中appendChild与append区别Feb 20, 2024 pm 06:57 PM

JS中appendChild与append区别，需要具体代码示例在JavaScript中，当我们需要动态地向DOM（文档对象模型）中添加子元素时，我们通常使用appendChild和append这两个方法。虽然它们的目的都是为了向父元素中添加子元素，但在使用上却有一些区别。一、appendChild方法appendChild方法是DOM节点对象的方法之一，用

我们如何在所有HTML元素上嵌入自定义数据属性？Aug 28, 2023 pm 12:49 PM

在本文中，我们需要在所有HTML元素上嵌入自定义数据属性。我们可以使用HTML中的data-*属性来实现。在HTML中，data-*属性用于自定义仅对网页或应用程序私有的数据。该属性可为HTML元素添加自定义值。HTML中的data-*属性由两部分组成−属性值可以是任意字符串。属性名称应只包含小写字母，并且在前缀"data-"之后必须至少有一个字符。这些数据通常在JavaScript中用于改善用户体验。以下是在HTML元素上嵌入自定义数据属性的示例。示例1在这个例子中，我们已

JSP文件使用的技巧和注意事项Feb 01, 2024 am 09:15 AM

JSP文件的打开技巧与注意事项1.使用文本编辑器打开JSP文件JSP文件本质上是文本文件，因此可以使用任何文本编辑器来打开它们。一些流行的文本编辑器包括记事本、记事本++、SublimeText和Atom。2.在IDE中打开JSP文件如果你正在使用集成开发环境（IDE）来开发JSP应用程序，那么你也可以在IDE中打开JSP文件。一些流行的IDE包括Ec

HTML盒模型的概念及作用Feb 18, 2024 pm 09:49 PM

HTML盒模型是一种用于描述元素在网页中布局和定位的概念。它将每个HTML元素包装在一个矩形的盒子中，这个盒子由内容区域、内边距、边框和外边距组成。在编写网页时，了解盒模型对于控制元素的尺寸、位置和样式都非常重要。具体的盒模型示例可以通过以下代码进行演示：

CSS中相对单位和绝对单位有何异同？Feb 18, 2024 pm 10:07 PM

CSS（层叠样式表）是一种用于描述网页上元素样式的标记语言。在CSS中，有两种不同的长度单位，分别是相对单位和绝对单位。相对单位是相对于元素自身或其父元素的大小来计算的。常见的相对单位有：百分比（%）、em和rem。百分比单位是相对于父元素的大小来计算的。例如，如果父元素的宽度为400px，子元素的宽度设置为50%，那么子元素的实际宽度就是200px（400

HTML全局属性的实际运用场景：5个提升网页开发效率的技巧Feb 18, 2024 pm 05:35 PM

HTML全局属性的实际应用案例：提升网页开发效率的5个技巧HTML作为构建网页结构的标记语言，拥有许多全局属性，它们可以被应用在不同的元素上，用于实现不同的功能和效果。在网页开发过程中，合理地使用这些全局属性可以极大地提高开发效率。本文将为您介绍5个实际应用案例，并附上相应的代码示例。class属性的应用：批量修改样式class属性可以给HTML元素指定

深入学习响应式布局框架：适合初学者到专家的详尽指南Feb 19, 2024 pm 05:43 PM

响应式布局框架解析：从初学者到专家的必备指南随着移动设备的普及和多样化，响应式布局成为了现代Web设计的必备技能。响应式布局框架以其简单、灵活和可维护的特点，成为了开发者们的首选工具。然而，对于初学者来说，学习和理解响应式布局框架可能会感到有些困惑。本文将从初学者到专家，为您提供一个详细的指南，帮助您掌握响应式布局框架，同时提供具体的代码示例。什么是响应式布

See all articles

Hot AI Tools

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress images for free

Clothoff.io

AI clothes remover

AI Hentai Generator

Generate AI Hentai for free.

Hot Article

R.E.P.O. Energy Crystals Explained and What They Do (Yellow Crystal)

2 weeks agoBy尊渡假赌尊渡假赌尊渡假赌

How Long Does It Take To Beat Split Fiction?

1 months agoByDDD

R.E.P.O. Best Graphic Settings

2 weeks agoBy尊渡假赌尊渡假赌尊渡假赌

Assassin's Creed Shadows: Seashell Riddle Solution

1 weeks agoByDDD

R.E.P.O. How to Fix Audio if You Can't Hear Anyone

2 weeks agoBy尊渡假赌尊渡假赌尊渡假赌

Hot Tools

PhpStorm Mac version

The latest (2018.2.1) professional PHP integrated development tool

DVWA

Damn Vulnerable Web App (DVWA) is a PHP/MySQL web application that is very vulnerable. Its main goals are to be an aid for security professionals to test their skills and tools in a legal environment, to help web developers better understand the process of securing web applications, and to help teachers/students teach/learn in a classroom environment Web application security. The goal of DVWA is to practice some of the most common web vulnerabilities through a simple and straightforward interface, with varying degrees of difficulty. Please note that this software

SecLists

SecLists is the ultimate security tester's companion. It is a collection of various types of lists that are frequently used during security assessments, all in one place. SecLists helps make security testing more efficient and productive by conveniently providing all the lists a security tester might need. List types include usernames, passwords, URLs, fuzzing payloads, sensitive data patterns, web shells, and more. The tester can simply pull this repository onto a new test machine and he will have access to every type of list he needs.

Safe Exam Browser

Safe Exam Browser is a secure browser environment for taking online exams securely. This software turns any computer into a secure workstation. It controls access to any utility and prevents students from using unauthorized resources.

MinGW - Minimalist GNU for Windows

This project is in the process of being migrated to osdn.net/projects/mingw, you can continue to follow us there. MinGW: A native Windows port of the GNU Compiler Collection (GCC), freely distributable import libraries and header files for building native Windows applications; includes extensions to the MSVC runtime to support C99 functionality. All MinGW software can run on 64-bit Windows platforms.

Hot Topics

Where is the login entrance for gmail email?

7405

1630

1358

1268

1218