To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????D?????????D^ 001111110011111100111111001111110011111100111111001111110011111100111111010001000011111100111111001111110011111100111111001111110011111100111111001111110100010001011110 3f3f3f3f3f3f3f3f3f443f3f3f3f3f3f3f3f3f445e
SJIS-WIN ??????瓮f?D??????瓮f?D^ 00111111001111110011111100111111001111110011111111100001010001001000001010000110001111110100010000111111001111110011111100111111001111110011111111100001010001001000001010000110001111110100010001011110 3f3f3f3f3f3fe14482863f443f3f3f3f3f3fe14482863f445e
EUC-JP ??????瓮f?D??????瓮f?D^ 00111111001111110011111100111111001111110011111111100001101001011010001111100110001111110100010000111111001111110011111100111111001111110011111111100001101001011010001111100110001111110100010001011110 3f3f3f3f3f3fe1a5a3e63f443f3f3f3f3f3fe1a5a3e63f445e
UTF-8 曆낁랜僚묉톭瓮f쪛D曆낁랜僚묉톭瓮f쪛D^ 111011111010011010001011111010111000001010000001111010111001111010011100111011111010011010111011111010111010110010001001111011011000011010101101111001111001001110101110111011111011110110000110111011001010101010011011010001001110111110100110100010111110101110000010100000011110101110011110100111001110111110100110101110111110101110101100100010011110110110000110101011011110011110010011101011101110111110111101100001101110110010101010100110110100010001011110 efa68beb8281eb9e9cefa6bbebac89ed86ade793aeefbd86ecaa9b44efa68beb8281eb9e9cefa6bbebac89ed86ade793aeefbd86ecaa9b445e
UHC 曆낁랜僚묉톭瓮f쪛D曆낁랜僚묉톭瓮f쪛D^ 111001101011011110000101111010001011011110100011111010001110100010010001111001101011011110000101111010001011011110100011111001101010010110010100010001001110011010110111100001011110100010110111101000111110100011101000100100011110011010110111100001011110100010110111101000111110011010100101100101000100010001011110 e6b785e8b7a3e8e891e6b785e8b7a3e6a59444e6b785e8b7a3e8e891e6b785e8b7a3e6a594445e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)