/server_test /reports/software_asserts.txt"."/> /server_test /reports/software_asserts.txt".">

Home  >  Article  >  Operation and Maintenance  >  What to do if linux socket is garbled

What to do if linux socket is garbled

藏色散人
藏色散人Original
2023-03-16 09:26:301814browse

Solution to Linux socket garbled code: 1. Open the terminal command window; 2. Use the iconv command to convert the Linux file character set. The conversion command is such as "iconv -f utf-8 -t gb2312 /server_test/reports/ software_.txt > /server_test/reports/software_asserts.txt".

What to do if linux socket is garbled

#The operating environment of this tutorial: linux5.9.8 system, Dell G3 computer.

What should I do if the Linux socket is garbled?

linux, windows, socket garbled problem

Problem description:

There is a ftp client written by myself in linux, and there is a built-in ftp client in windows. The ftp server built by the software, when I create a directory containing Chinese characters or a file name containing Chinese characters on the server through the program, the Chinese name is garbled

Problem analysis:

There is no doubt that It is caused by an encoding problem, so how to confirm which encoding it should be? Baidu later discovered that ubuntu uses utf-8 by default, while windows uses gbk by default, so just make sure that content containing Chinese paths is converted to gbk when it is sent out

Solution:

Conversion between utf8 and gbk:

Linux file character set conversion (utf8-gb2312)

1, command line

When performing encoding conversion on LINUX, you can use the iconv command to achieve this. This is for files, that is, converting the specified file from one encoding to another encoding.

iconv command usage is as follows:

iconv [选项...] [文件...]
  • Input/output format specification:

  -f, --from-code=名称 原始文本编码
  -t, --to-code=名称 输出编码
  • List All known character sets -l, --list

  • Output control:

  -c   从输出中忽略无效的字符
  -o, --output=FILE   输出文件
  -s, --silent   关闭警告
  --verbose   打印进度信息

Example:

iconv -f utf-8 -t gb2312 /server_test/reports/software_.txt > /server_test/reports/software_asserts.txt

Please download the libiconv library on the embedded board for the following required library files

2, function application

1.Header file

#include <iconv.h>

2. Function

(1) iconv_t iconv_open(const char *tocode, const char *fromcode);

This function explains which two encodings will be converted, tocode is the target encoding, fromcode is the original encoding, and this function returns a conversion handle for use by the following two functions.

(2) size_t iconv(iconv_t cd,char **inbuf,size_t *inbytesleft,char **outbuf,size_t *outbytesleft);

This function reads characters from inbuf and converts them Then output to outbuf, inbytesleft is used to record the number of characters that have not been converted, and outbytesleft is used to record the remaining space of the output buffer.

(3) int iconv_close(iconv_t cd);

This function is used to close the conversion handle and release resources.

3. Example: Conversion sample program implemented in C language

#include "stdio.h"
#include "string.h"
#include "iconv.h"
#include "stdlib.h"
 
static  int  charset_convert( const  char  *from_charset,  const  char  *to_charset,
                            char  *in_buf,  size_t  in_left,  char  *out_buf,  size_t  out_left)
{
     iconv_t icd = (iconv_t)-1;
     size_t  sRet = -1;
     char  *pIn = in_buf;
     char  *pOut = out_buf;
     size_t  outLen = out_left;
 
     if  (NULL == from_charset || NULL == to_charset || NULL == in_buf || 0 >= in_left || NULL == out_buf || 0 >= out_left)
     {
         return  -1;
     }
 
     icd = iconv_open(to_charset, from_charset);
     if  ((iconv_t)-1 == icd)
     {
         return  -1;
     }
 
     sRet = iconv(icd, &pIn, &in_left, &pOut, &out_left);
     if  (( size_t )-1 == sRet)
     {
         iconv_close(icd);
         return  -1;
     }
 
     out_buf[outLen - out_left] = 0;
     iconv_close(icd);
     return  ( int )(outLen - out_left);
}
 
static  int  charset_convert_UTF8_TO_GB2312( char  *in_buf,  size_t  in_left,  char  *out_buf,  size_t  out_left)
{
     return  charset_convert( "UTF-8" ,  "GB2312" , in_buf, in_left, out_buf, out_left);
}
 
static  int  charset_convert_GB2312_TO_UTF8( char  *in_buf,  size_t  in_left,  char  *out_buf,  size_t  out_left)
{
     return  charset_convert( "GB2312-8" ,  "UTF-8" , in_buf, in_left, out_buf, out_left);
}
 
int  main()
{
     char  *pIn =  "hello 中国人" ;
     char  *pOut = NULL;
     size_t  outLen = 0;
     size_t  inLen = 0;
     int  iRet = -1;
 
     inLen =  strlen (pIn);
     outLen = inLen * 3;
     pOut = ( char  *) malloc ( sizeof ( char ) * outLen);
     if  (NULL == pOut)
     {
         return  -1;
     }
     memset (pOut, 0,  sizeof ( char ) * outLen);
 
     iRet = charset_convert_UTF8_TO_GB2312(pIn, ( size_t )inLen, pOut, ( size_t )inLen);
     if  (-1 == iRet)
     {
         return  -1;
     }
 
     printf ( "out = %s;\touLen = %d\n" , pOut, iRet);
     free (pOut);
     pOut = NULL;
     return  0;
}

This implementation can be cross-platform, and I personally think it is better~

ps: The ftp client I made before is On Windows, because Party A said it wanted to transplant it, it switched to Windows, and then the above problem occurred. Although I compared the Chinese solutions that appeared in the project, I at least accumulated experience in solving the problem (no Chinese, You can use Pinyin)

Related recommendations: "Linux Video Tutorial"

The above is the detailed content of What to do if linux socket is garbled. For more information, please follow other related articles on the PHP Chinese website!

Statement:
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn