To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 嗚??泣??儒??閻????┼???五??竊 1001101001101010001111110011111110001011100000110011111100111111100011101111001000111111001111111110100010000101001111110011111100111111001111111000010010101001001111110011111100111111100011001101110000111111001111111110001010000110 9a6a3f3f8b833f3f8ef23f3fe8853f3f3f3f84a93f3f3f8cdc3f3fe286
EUC-JP 嗚??泣??儒??閻??彛?┼洧??五??竊 110100111100101100111111001111111011010111100011001111110011111110111100111101000011111100111111111011111110010100111111001111111000111110111100111110100011111110101000101010111000111111000111101101000011111100111111101110001101111000111111001111111110001111100110 d3cb3f3fb5e33f3fbcf43f3fefe53f3f8fbcfa3fa8ab8fc7b43f3fb8de3f3fe3e6
UTF-8 嗚삠굦泣쒏껸儒룸짎閻롫벊彛랃┼洧꿸데五묎막竊 111001011001011110011010111011001000001010100000111010101011010110100110111001101011001110100011111011001001001010001111111010101011101110111000111001011000010010010010111010111010001110111000111011001010011110001110111010011001011010111011111010111010000110101011111010111011001010001010111001011011110110011011111010111001111010000011111000101001010010111100111001101011010010100111111010101011111110111000111010111000110110110000111001001011101010010100111010111010110010001110111010111010011110001001111001111010101110001010 e5979aec82a0eab5a6e6b3a3ec928feabbb8e58492eba3b8eca78ee996bbeba1abebb28ae5bd9beb9e83e294bce6b4a7eabfb8eb8db0e4ba94ebac8eeba789e7ab8a
UHC 嗚삠굦泣쒏껸儒룸짎閻롫벊彛랃┼洧꿸데五묎막竊 1110011111110000101110111110001110000010100011001110101111101000100111001110011010110010101110011110101011100011101101111110101110100011100110101110011110100010100011101110101110010011101011011110110010101101100011011110111110100110101010111110101011111011101100101110101010110101101001011110011111101001100100011110101010111000101101111110111110111100 e7f0bbe3828cebe89ce6b2b9eae3b7eba39ae7a28eeb93adecad8defa6abeafbb2eab5a5e7e991eab8b7efbc

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)