Go Colly如何找到请求的元素？-Golang-PHP中文网

首页

后端开发

Golang

Go Colly如何找到请求的元素？

PHPz

Feb 13, 2024 pm 01:57 PM

go语言

Go Colly如何找到请求的元素？

php小编香蕉将为大家介绍一款强大的网络爬虫框架——Go Colly。Go Colly是基于Go语言开发的一款轻量级网络爬虫框架，它具有高性能、高并发、易扩展等特点。在使用Go Colly进行网络爬取时，我们常常需要根据自己的需求找到请求的元素。那么，Go Colly如何找到请求的元素呢？接下来，我们将一一为大家解答。

问题内容

我正在尝试使用 colly 让特定的表循环遍历其内容，但该表未被识别，这是我到目前为止所拥有的。

package main

import (
    "fmt"
    
    "github.com/gocolly/colly"
)

func main() {
    c := colly.NewCollector(
        colly.AllowedDomains("wikipedia.org", "en.wikipedia.org"),
    )
    
    links := make([]string, 0)

    c.OnHTML("div.mw-parser-output", func(e *colly.HTMLElement) {
        
        e.ForEach("table.wikitable.sortable.jquery-tablesorter > tbody > tr", func(_ int, elem *colly.HTMLElement) {
            fmt.Println(elem.ChildAttr("a[href]", "href"))
            links = append(links, elem.ChildAttr("a[href]", "href"))
        })
    })
    
    c.OnRequest(func(r *colly.Request) {
        fmt.Println("Visiting", r.URL.String())
    })

    c.Visit("https://en.wikipedia.org/wiki/List_of_countries_and_dependencies_by_population")
    fmt.Println("Found urls for", len(links), "countries.")
}

我需要循环思考表中的所有 tr 元素。

解决方法

事实证明，类的名称实际上是 wikitable.sortable，即使在 chrome 控制台中显示为 wikitable sortable jquery-tablesorter。我不知道为什么名称如此不同，但它解决了我的问题。

以上是Go Colly如何找到请求的元素？的详细内容。更多信息请关注PHP中文网其他相关文章！

声明

本文转载于：stackoverflow。如有侵权，请联系admin@php.cn删除

与GO接口键入断言和类型开关May 02, 2025 am 12:20 AM

Gohandlesinterfacesandtypeassertionseffectively,enhancingcodeflexibilityandrobustness.1)Typeassertionsallowruntimetypechecking,asseenwiththeShapeinterfaceandCircletype.2)Typeswitcheshandlemultipletypesefficiently,usefulforvariousshapesimplementingthe

使用errors.is和错误。May 02, 2025 am 12:11 AM

Go语言的错误处理通过errors.Is和errors.As函数变得更加灵活和可读。1.errors.Is用于检查错误是否与指定错误相同，适用于错误链的处理。2.errors.As不仅能检查错误类型，还能将错误转换为具体类型，方便提取错误信息。使用这些函数可以简化错误处理逻辑，但需注意错误链的正确传递和避免过度依赖以防代码复杂化。

在GO中进行性能调整：优化您的应用程序May 02, 2025 am 12:06 AM

tomakegoapplicationsRunfasterandMorefly，useProflingTools，leverageConCurrency，andManageMoryfectily.1）usepprofforcpuorforcpuandmemoryproflingtoidentifybottlenecks.2）upitizegorizegoroutizegoroutinesandchannelstoparalletaparelalyizetasksandimproverperformance.3）

GO的未来：趋势和发展May 02, 2025 am 12:01 AM

go'sfutureisbrightwithtrendslikeMprikeMprikeTooling，仿制药，云 - 纳蒂维德象，performanceEnhancements，andwebassemblyIntegration，butchallengeSinclainSinClainSinClainSiNgeNingsImpliCityInsImplicityAndimimprovingingRornhandRornrorlling。

了解Goroutines：深入研究GO的并发May 01, 2025 am 12:18 AM

goroutinesarefunctionsormethodsthatruncurranceingo，启用效率和灯威量。1）shememanagedbodo'sruntimemultimusingmultiplexing，允许千sstorunonfewerosthreads.2）goroutinessimproverentimensImproutinesImproutinesImproveranceThroutinesImproveranceThrountinesimproveranceThroundinesImproveranceThroughEasySytaskParallowalizationAndeff

了解GO中的初始功能：目的和用法May 01, 2025 am 12:16 AM

purposeoftheInitfunctionoIsistoInitializeVariables，setUpConfigurations，orperformneccesSetarySetupBeforEtheMainFunctionExeCutes.useInitby.UseInitby：1）placingitinyourcodetorunautoamenationally oneraty oneraty oneraty on inity in ofideShortAndAndAndAndForemain，2）keepitiTshortAntAndFocusedonSimImimpletasks，3）

了解GO界面：综合指南May 01, 2025 am 12:13 AM

Gointerfacesaremethodsignaturesetsthattypesmustimplement,enablingpolymorphismwithoutinheritanceforcleaner,modularcode.Theyareimplicitlysatisfied,usefulforflexibleAPIsanddecoupling,butrequirecarefulusetoavoidruntimeerrorsandmaintaintypesafety.