To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????^?????????^B 001111110011111100111111001111110011111100111111001111110011111100111111010111100011111100111111001111110011111100111111001111110011111100111111001111110101111001000010 3f3f3f3f3f3f3f3f3f5e3f3f3f3f3f3f3f3f3f5e42
SJIS-WIN ??濯?????〓^??濯?????〓^B 00111111001111111001000111110011001111110011111100111111001111110011111110000001101011000101111000111111001111111001000111110011001111110011111100111111001111110011111110000001101011000101111001000010 3f3f91f33f3f3f3f3f81ac5e3f3f91f33f3f3f3f3f81ac5e42
EUC-JP ??濯?????〓^??濯?????〓^B 00111111001111111100001011110101001111110011111100111111001111110011111110100010101011100101111000111111001111111100001011110101001111110011111100111111001111110011111110100010101011100101111001000010 3f3fc2f53f3f3f3f3fa2ae5e3f3fc2f53f3f3f3f3fa2ae5e42
UTF-8 룶점濯룵혧◐룶웡〓^룶점濯룵혧◐룶웡〓^B 111010111010001110110110111011001010000010010000111001101011111110101111111010111010001110110101111011011001100010100111111000101001011110010000111010111010001110110110111011001001101110100001111000111000000010010011010111101110101110100011101101101110110010100000100100001110011010111111101011111110101110100011101101011110110110011000101001111110001010010111100100001110101110100011101101101110110010011011101000011110001110000000100100110101111001000010 eba3b6eca090e6bfafeba3b5ed98a7e29790eba3b6ec9ba1e380935eeba3b6eca090e6bfafeba3b5ed98a7e29790eba3b6ec9ba1e380935e42
UHC 룶점濯룵혧◐룶웡〓^룶점濯룵혧◐룶웡〓^B 100011111010101111000001101000011111011011111011100011111010101011000010100011111010001011000100100011111010101110111111111111011010000111101011010111101000111110101011110000011010000111110110111110111000111110101010110000101000111110100010110001001000111110101011101111111111110110100001111010110101111001000010 8fabc1a1f6fb8faac28fa2c48fabbffda1eb5e8fabc1a1f6fb8faac28fa2c48fabbffda1eb5e42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)