Welcome to OGeek Q&A Community for programmer and developer-Open, Learning and Share
Welcome To Ask or Share your Answers For Others

Categories

0 votes
406 views
in Technique[技术] by (71.8m points)

PHP中的mb_detect_encoding和mb_convert_encoding方法获取编码和转码问题

  1. 用mb_detect_encoding获得的返回值是cp936这个与GBK对应?

  2. 通过mb_convert_encoding转码后,虽然文本显示正常,但是在用mb_detect_encoding检测文本编码格式,仍然是cp936,并没有改变,这是为什么?

代码如下:

$file_contents = fread($file,$fileSize);

$typeofData = mb_detect_encoding($file_contents,array("GBK","GB2312","UTF-8","ASCII","BIG5"));

if ($typeofData != "UTF-8"){
//    $file_contents = iconv("GBK","UTF-8",$file_contents);
    $file_contents = mb_convert_encoding($file_contents,"UTF-8","GBK");
}

echo  mb_detect_encoding($file_contents,array("GBK","GB2312","UTF-8","ASCII","BIG5"))."<br/>";
echo $file_contents;

与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…
Welcome To Ask or Share your Answers For Others

1 Reply

0 votes
by (71.8m points)

GBK的Code Page是CP936.
我在Ubuntu上用PHP5和PHP7都试过了,转成UTF-8编码后能够检测到UTF-8:

<?php
$str = file_get_contents('/path/to/gbk.txt'); //GBK编码的文本文件
$order = array('GB2312', 'GBK', 'GB18030', 'UTF-8', 'ASCII', 'BIG5');
$encode = mb_detect_encoding($str, $order, true); //可见CP936(即GBK)
$str = mb_convert_encoding($str, 'UTF-8', $encode); //转成UTF-8
echo mb_detect_encoding($str, $order, true); //输出UTF-8

与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…
OGeek|极客中国-欢迎来到极客的世界,一个免费开放的程序员编程交流平台!开放,进步,分享!让技术改变生活,让极客改变未来! Welcome to OGeek Q&A Community for programmer and developer-Open, Learning and Share
Click Here to Ask a Question

...