⚝
One Hat Cyber Team
⚝
Your IP:
216.73.216.30
Server IP:
45.79.8.107
Server:
Linux localhost 5.15.0-140-generic #150-Ubuntu SMP Sat Apr 12 06:00:09 UTC 2025 x86_64
Server Software:
nginx/1.18.0
PHP Version:
8.1.2-1ubuntu2.21
Buat File
|
Buat Folder
Eksekusi
Dir :
~
/
lib
/
python3
/
dist-packages
/
chardet
/
__pycache__
/
View File Name :
charsetprober.cpython-310.pyc
o -åÏ_ö ã @ s0 d dl Z d dlZddlmZ G dd„ deƒZdS )é Né )ÚProbingStatec @ sn e Zd ZdZddd„Zdd„ Zedd„ ƒZd d „ Zedd„ ƒZ d d„ Z edd„ ƒZedd„ ƒZ edd„ ƒZdS )Ú CharSetProbergffffffî?Nc C s d | _ || _t t¡| _d S ©N)Ú_stateÚlang_filterÚloggingZ getLoggerÚ__name__Úlogger)Úselfr © r ú7/usr/lib/python3/dist-packages/chardet/charsetprober.pyÚ__init__' s zCharSetProber.__init__c C s t j| _d S r )r Z DETECTINGr ©r r r r Úreset, s zCharSetProber.resetc C ó d S r r r r r r Úcharset_name/ s zCharSetProber.charset_namec C r r r )r Úbufr r r Úfeed3 ó zCharSetProber.feedc C s | j S r )r r r r r Ústate6 s zCharSetProber.statec C s dS )Ng r r r r r Úget_confidence: r zCharSetProber.get_confidencec C s t dd| ¡} | S )Ns ([ -])+ó )ÚreÚsub)r r r r Úfilter_high_byte_only= s z#CharSetProber.filter_high_byte_onlyc C s\ t ƒ }t d| ¡}|D ] }| |dd… ¡ |dd… }| ¡ s&|dk r&d}| |¡ q|S )u9 We define three types of bytes: alphabet: english alphabets [a-zA-Z] international: international characters [€-ÿ] marker: everything else [^a-zA-Z€-ÿ] The input buffer can be thought to contain a series of words delimited by markers. This function works to filter all words that contain at least one international character. All contiguous sequences of markers are replaced by a single space ascii character. This filter applies to all scripts which do not use English characters. s% [a-zA-Z]*[€-ÿ]+[a-zA-Z]*[^a-zA-Z€-ÿ]?Néÿÿÿÿó €r )Ú bytearrayr ÚfindallÚextendÚisalpha)r ÚfilteredZwordsZwordZ last_charr r r Úfilter_international_wordsB s ÿz(CharSetProber.filter_international_wordsc C s¤ t ƒ }d}d}tt| ƒƒD ]7}| ||d … }|dkrd}n|dkr$d}|dk rD| ¡ sD||kr@|s@| | ||… ¡ | d¡ |d }q |sP| | |d … ¡ |S ) aÈ Returns a copy of ``buf`` that retains only the sequences of English alphabet and high byte characters that are not between <> characters. Also retains English alphabet and high byte characters immediately before occurrences of >. This filter can be applied to all scripts which contain both English characters and extended ASCII characters, but is currently only used by ``Latin1Prober``. Fr r ó >ó