To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ??????沃??嚥??蘊???↑た汝??B 00111111001111110011111100111111001111110011111110010111100000000011111100111111100110101000101100111111001111111110010101011101001111110011111100111111100000011010101010000010101111011001001111110000001111110011111101000010 3f3f3f3f3f3f97803f3f9a8b3f3fe55d3f3f3f81aa82bd93f03f3f42
EUC-JP ??????沃??嚥??蘊???↑た汝??B 00111111001111110011111100111111001111110011111111001101111000000011111100111111110100111110101100111111001111111110100110111110001111110011111100111111101000101010110010100100101111111100011011110010001111110011111101000010 3f3f3f3f3f3fcde03f3fd3eb3f3fe9be3f3f3fa2aca4bfc6f23f3f42
UTF-8 廬볢퀌溜곕젩沃욘짂嚥ㅶ젗蘊귣젾溜↑た汝믤엘B 11101111101001101000001011101011101100111010001011101101100000001000110011101111101001111000101111101010101100111001010111101100101000001010100111100110101100101000001111101100100110101001100011101100101001111000001011100101100110101010010111100011100001011011011011101100101000001001011111101000100110001000101011101010101101111010001111101100101000001011111011101111101001111000101111100010100001101001000111100011100000011001111111100110101100011001110111101011101011111010010011101100100101111001100001000010 efa682ebb3a2ed808cefa78beab395eca0a9e6b283ec9a98eca782e59aa5e385b6eca097e8988aeab7a3eca0beefa78be28691e3819fe6b19debafa4ec979842
UHC 廬볢퀌溜곕젩沃욘짂嚥ㅶ젗蘊귣젾溜↑た汝믤엘B 11100101111111101001001111101000101100111000001011101010111111101011000011101011101000001010000111101000101010101011111111100110101000111001001011100110101111111010010011100110101000001001001111101000101100111000001011101011101000001011000011101010111111101010000111101000101010101011111111100110101000111001001011100110101111111010010001000010 e5fe93e8b382eafeb0eba0a1e8aabfe6a392e6bfa4e6a093e8b382eba0b0eafea1e8aabfe6a392e6bfa442

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)