To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?C??nf?C??n^}Y?C??nf?C??n^}bE 0011111101000011001111110011111101101110011001100011111101000011001111110011111101101110010111100111110101011001001111110100001100111111001111110110111001100110001111110100001100111111001111110110111001011110011111010110001001000101 3f433f3f6e663f433f3f6e5e7d593f433f3f6e663f433f3f6e5e7d6245
SJIS-WIN 短C賊他nf短C賊他n^}Y短C賊他nf短C賊他n^}bE 1001001001011010010000111001000110101111100100011011110001101110011001101001001001011010010000111001000110101111100100011011110001101110010111100111110101011001100100100101101001000011100100011010111110010001101111000110111001100110100100100101101001000011100100011010111110010001101111000110111001011110011111010110001001000101 925a4391af91bc6e66925a4391af91bc6e5e7d59925a4391af91bc6e66925a4391af91bc6e5e7d6245
EUC-JP 短C賊他nf短C賊他n^}Y短C賊他nf短C賊他n^}bE 1100001110111011010000111100001010110001110000101011111001101110011001101100001110111011010000111100001010110001110000101011111001101110010111100111110101011001110000111011101101000011110000101011000111000010101111100110111001100110110000111011101101000011110000101011000111000010101111100110111001011110011111010110001001000101 c3bb43c2b1c2be6e66c3bb43c2b1c2be6e5e7d59c3bb43c2b1c2be6e66c3bb43c2b1c2be6e5e7d6245
UTF-8 短C賊他nf短C賊他n^}Y短C賊他nf短C賊他n^}bE 1110011110011111101011010100001111101000101100111000101011100100101110111001011001101110011001101110011110011111101011010100001111101000101100111000101011100100101110111001011001101110010111100111110101011001111001111001111110101101010000111110100010110011100010101110010010111011100101100110111001100110111001111001111110101101010000111110100010110011100010101110010010111011100101100110111001011110011111010110001001000101 e79fad43e8b38ae4bb966e66e79fad43e8b38ae4bb966e5e7d59e79fad43e8b38ae4bb966e66e79fad43e8b38ae4bb966e5e7d6245
UHC 短C賊他nf短C賊他n^}Y短C賊他nf短C賊他n^}bE 1101001110101101010000111110111011100100111101101110001001101110011001101101001110101101010000111110111011100100111101101110001001101110010111100111110101011001110100111010110101000011111011101110010011110110111000100110111001100110110100111010110101000011111011101110010011110110111000100110111001011110011111010110001001000101 d3ad43eee4f6e26e66d3ad43eee4f6e26e5e7d59d3ad43eee4f6e26e66d3ad43eee4f6e26e5e7d6245

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)