To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?C?Cf?C?C^}Y?C?Cf?C?C^}bE 00111111010000110011111101000011011001100011111101000011001111110100001101011110011111010101100100111111010000110011111101000011011001100011111101000011001111110100001101011110011111010110001001000101 3f433f43663f433f435e7d593f433f43663f433f435e7d6245
SJIS-WIN 哀C哀Cf哀C哀C^}Y哀C哀Cf哀C哀C^}bE 100010001010001101000011100010001010001101000011011001101000100010100011010000111000100010100011010000110101111001111101010110011000100010100011010000111000100010100011010000110110011010001000101000110100001110001000101000110100001101011110011111010110001001000101 88a34388a3436688a34388a3435e7d5988a34388a3436688a34388a3435e7d6245
EUC-JP 哀C哀Cf哀C哀C^}Y哀C哀Cf哀C哀C^}bE 101100001010010101000011101100001010010101000011011001101011000010100101010000111011000010100101010000110101111001111101010110011011000010100101010000111011000010100101010000110110011010110000101001010100001110110000101001010100001101011110011111010110001001000101 b0a543b0a54366b0a543b0a5435e7d59b0a543b0a54366b0a543b0a5435e7d6245
UTF-8 哀C哀Cf哀C哀C^}Y哀C哀Cf哀C哀C^}bE 1110010110010011100000000100001111100101100100111000000001000011011001101110010110010011100000000100001111100101100100111000000001000011010111100111110101011001111001011001001110000000010000111110010110010011100000000100001101100110111001011001001110000000010000111110010110010011100000000100001101011110011111010110001001000101 e5938043e593804366e5938043e59380435e7d59e5938043e593804366e5938043e59380435e7d6245
UHC 哀C哀Cf哀C哀C^}Y哀C哀Cf哀C哀C^}bE 111001001110111001000011111001001110111001000011011001101110010011101110010000111110010011101110010000110101111001111101010110011110010011101110010000111110010011101110010000110110011011100100111011100100001111100100111011100100001101011110011111010110001001000101 e4ee43e4ee4366e4ee43e4ee435e7d59e4ee43e4ee4366e4ee43e4ee435e7d6245

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)