To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???hf???h^}Y???hf???h^}bE 00111111001111110011111101101000011001100011111100111111001111110110100001011110011111010101100100111111001111110011111101101000011001100011111100111111001111110110100001011110011111010110001001000101 3f3f3f68663f3f3f685e7d593f3f3f68663f3f3f685e7d6245
SJIS-WIN 絶??hf絶??h^}Y絶??hf絶??h^}bE 1001000011100010001111110011111101101000011001101001000011100010001111110011111101101000010111100111110101011001100100001110001000111111001111110110100001100110100100001110001000111111001111110110100001011110011111010110001001000101 90e23f3f686690e23f3f685e7d5990e23f3f686690e23f3f685e7d6245
EUC-JP 絶??hf絶??h^}Y絶??hf絶??h^}bE 1100000011100100001111110011111101101000011001101100000011100100001111110011111101101000010111100111110101011001110000001110010000111111001111110110100001100110110000001110010000111111001111110110100001011110011111010110001001000101 c0e43f3f6866c0e43f3f685e7d59c0e43f3f6866c0e43f3f685e7d6245
UTF-8 絶랃쉼hf絶랃쉼h^}Y絶랃쉼hf絶랃쉼h^}bE 11100111101101011011011011101011100111101000001111101100100010011011110001101000011001101110011110110101101101101110101110011110100000111110110010001001101111000110100001011110011111010101100111100111101101011011011011101011100111101000001111101100100010011011110001101000011001101110011110110101101101101110101110011110100000111110110010001001101111000110100001011110011111010110001001000101 e7b5b6eb9e83ec89bc6866e7b5b6eb9e83ec89bc685e7d59e7b5b6eb9e83ec89bc6866e7b5b6eb9e83ec89bc685e7d6245
UHC 絶랃쉼hf絶랃쉼h^}Y絶랃쉼hf絶랃쉼h^}bE 11101111101111101000110111101111101111011011000001101000011001101110111110111110100011011110111110111101101100000110100001011110011111010101100111101111101111101000110111101111101111011011000001101000011001101110111110111110100011011110111110111101101100000110100001011110011111010110001001000101 efbe8defbdb06866efbe8defbdb0685e7d59efbe8defbdb06866efbe8defbdb0685e7d6245

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)