To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????z?????????zB 001111110011111100111111001111110011111100111111001111110011111100111111011110100011111100111111001111110011111100111111001111110011111100111111001111110111101001000010 3f3f3f3f3f3f3f3f3f7a3f3f3f3f3f3f3f3f3f7a42
SJIS-WIN ??を??あ??やz??を??あ??やzB 001111110011111110000010111100000011111100111111100000101010000000111111001111111000001011100010011110100011111100111111100000101111000000111111001111111000001010100000001111110011111110000010111000100111101001000010 3f3f82f03f3f82a03f3f82e27a3f3f82f03f3f82a03f3f82e27a42
EUC-JP ??を??あ??やz??を??あ??やzB 001111110011111110100100111100100011111100111111101001001010001000111111001111111010010011100100011110100011111100111111101001001111001000111111001111111010010010100010001111110011111110100100111001000111101001000010 3f3fa4f23f3fa4a23f3fa4e47a3f3fa4f23f3fa4a23f3fa4e47a42
UTF-8 룶쥚を룶쥚あ룶쥚やz룶쥚を룶쥚あ룶쥚やzB 111010111010001110110110111011001010010110011010111000111000001010010010111010111010001110110110111011001010010110011010111000111000000110000010111010111010001110110110111011001010010110011010111000111000001010000100011110101110101110100011101101101110110010100101100110101110001110000010100100101110101110100011101101101110110010100101100110101110001110000001100000101110101110100011101101101110110010100101100110101110001110000010100001000111101001000010 eba3b6eca59ae38292eba3b6eca59ae38182eba3b6eca59ae382847aeba3b6eca59ae38292eba3b6eca59ae38182eba3b6eca59ae382847a42
UHC 룶쥚を룶쥚あ룶쥚やz룶쥚を룶쥚あ룶쥚やzB 100011111010101110100010100011111010101011110010100011111010101110100010100011111010101010100010100011111010101110100010100011111010101011100100011110101000111110101011101000101000111110101010111100101000111110101011101000101000111110101010101000101000111110101011101000101000111110101010111001000111101001000010 8faba28faaf28faba28faaa28faba28faae47a8faba28faaf28faba28faaa28faba28faae47a42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)