To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??D??????????D????????? 0011111100111111010001000011111100111111001111110011111100111111001111110011111100111111001111110011111101000100001111110011111100111111001111110011111100111111001111110011111100111111 3f3f443f3f3f3f3f3f3f3f3f3f443f3f3f3f3f3f3f3f3f
SJIS-WIN ??D???????壙普?D???????壙六 001111110011111101000100001111110011111100111111001111110011111100111111001111111001101011011011100101011000000100111111010001000011111100111111001111110011111100111111001111110011111110011010110110111001100001011010 3f3f443f3f3f3f3f3f3f9adb95813f443f3f3f3f3f3f3f9adb985a
EUC-JP ??D???????壙普?D???????壙六 001111110011111101000100001111110011111100111111001111110011111100111111001111111101010011011101110010011110000100111111010001000011111100111111001111110011111100111111001111110011111111010100110111011100111110111011 3f3f443f3f3f3f3f3f3fd4ddc9e13f443f3f3f3f3f3f3fd4ddcfbb
UTF-8 렻렮D렺셍렒렻렮렻팍壙普렮D렺셍렒렻렮렻팍壙六 1110101110100000101110111110101110100000101011100100010011101011101000001011101011101100100001011000110111101011101000001001001011101011101000001011101111101011101000001010111011101011101000001011101111101101100011001000110111100101101000111001100111100110100110011010111011101011101000001010111001000100111010111010000010111010111011001000010110001101111010111010000010010010111010111010000010111011111010111010000010101110111010111010000010111011111011011000110010001101111001011010001110011001111001011000010110101101 eba0bbeba0ae44eba0baec858deba092eba0bbeba0aeeba0bbed8c8de5a399e699aeeba0ae44eba0baec858deba092eba0bbeba0aeeba0bbed8c8de5a399e585ad
UHC 렻렮D렺셍렒렻렮렻팍壙普렮D렺셍렒렻렮렻팍壙六 1000111011000011100011101011101101000100100011101100001010111100110001001000111010100111100011101100001110001110101110111000111011000011110001101100010111001110110001011101110011000101100011101011101101000100100011101100001010111100110001001000111010100111100011101100001110001110101110111000111011000011110001101100010111001110110001011101011110111111 8ec38ebb448ec2bcc48ea78ec38ebb8ec3c6c5cec5dcc58ebb448ec2bcc48ea78ec38ebb8ec3c6c5cec5d7bf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)