To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 蒻れ?油??碎??壓??依?? 111001001110100010000010111010100011111110010110111110110011111100111111111000011110101000111111001111111001101011011000001111110011111110001000110010110011111100111111 e4e882ea3f96fb3f3fe1ea3f3f9ad83f3f88cb3f3f
EUC-JP 蒻れ?油??碎??壓??依?? 111010001110101010100100111011000011111111001100111111010011111100111111111000101110110000111111001111111101010011011010001111110011111110110000110011010011111100111111 e8eaa4ec3fccfd3f3fe2ec3f3fd4da3f3fb0cd3f3f
UTF-8 蒻れ슜油녷룚碎듬펳壓믩툖依긷땔 111010001001001010111011111000111000001010001100111011001000101010011100111001101011001010111001111010111000010110110111111010111010001110011010111001111010001010001110111010111001001110101100111011011000111010110011111001011010001110010011111010111010111110101001111011011000100010010110111001001011111010011101111010101011100010110111111010111001010110010100 e892bbe3828cec8a9ce6b2b9eb85b7eba39ae7a28eeb93aced8eb3e5a393ebafa9ed8896e4be9deab8b7eb9594
UHC 蒻れ슜油녷룚碎듬펳壓믩툖依긷땔 111001011011011010101010111011001001101010101001111010101111101010000110111001101000111110010110111000011110111110110101111010111011110010000101111001001110001010010010111010111011100010001101111010111110111010110001111001011011011010101010 e5b6aaec9aa9eafa86e68f96e1efb5ebbc85e4e292ebb88debeeb1e5b6aa

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)