Home >Backend Development >Golang >How to Efficiently Bulk Insert CSV Data into PostgreSQL Using Go Without For Loops?

How to Efficiently Bulk Insert CSV Data into PostgreSQL Using Go Without For Loops?

Mary-Kate Olsen
Mary-Kate OlsenOriginal
2024-10-29 06:52:31313browse

How to Efficiently Bulk Insert CSV Data into PostgreSQL Using Go Without For Loops?

Efficient Bulk Data Insertion from CSV to PostgreSQL using Go (Without For Loops)

In Go, the Postgres COPY command provides an efficient way to bulk insert data from a CSV file into a PostgreSQL database. Here's how to achieve this without using any for loops:

Step 1: Establish Database Connection

Establish a database connection using the pgx library. This is essential for accessing the database and executing commands.

Step 2: Open the CSV File

Open the CSV file containing the data to be inserted. Make sure the file is accessible by your application.

Step 3: Execute COPY Command

Execute the PgConn().CopyFrom() method to initiate the data transfer from the CSV file to the database table. Here's an example:

<code class="go">import (
    "context"
    "fmt"
    "io"

    "github.com/jackc/pgx/v4/pgxpool"
)

const query = "COPY csv_test FROM STDIN (FORMAT csv)"

func main() {
    dbpool, err := pgxpool.Connect(context.Background(), os.Getenv("DATABASE_URL"))
    if err != nil {
        panic(err)
    }
    defer dbpool.Close()

    f, err := os.Open("foo.csv")
    if err != nil {
        panic(err)
    }
    defer f.Close()

    res, err := dbpool.Conn().PgConn().CopyFrom(context.Background(), f, query)
    if err != nil {
        panic(err)
    }
    fmt.Print(res.RowsAffected())
}</code>

Breakdown of the Code:

  • The query variable defines the COPY command that will be executed.
  • The main() function establishes a connection to the database using dbpool, opens the CSV file, and executes the extended COPY command using PgConn().CopyFrom().
  • The res variable stores the result of the execution, including the number of affected rows.

Benefits:

  • Efficiency: The COPY command is highly efficient for bulk data insertion, avoiding unnecessary overhead compared to loop-based approaches.
  • Concurrency: pgxpool allows for concurrent connections, enabling parallel operations for faster data processing.
  • Type Safety: pgx ensures type safety when executing the COPY command, ensuring data integrity.

The above is the detailed content of How to Efficiently Bulk Insert CSV Data into PostgreSQL Using Go Without For Loops?. For more information, please follow other related articles on the PHP Chinese website!

Statement:
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn