To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????D??????????D^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111101000100001111110011111100111111001111110011111100111111001111110011111100111111001111110100010001011110 3f3f3f3f3f3f3f3f3f3f443f3f3f3f3f3f3f3f3f3f445e
SJIS-WIN ?依????彧???D?依????彧???D^ 001111111000100011001011001111110011111100111111001111111111101010111001001111110011111100111111010001000011111110001000110010110011111100111111001111110011111111111010101110010011111100111111001111110100010001011110 3f88cb3f3f3f3ffab93f3f3f443f88cb3f3f3f3ffab93f3f3f445e
EUC-JP ?依????彧???D?依????彧???D^ 0011111110110000110011010011111100111111001111110011111110001111101111001111111000111111001111110011111101000100001111111011000011001101001111110011111100111111001111111000111110111100111111100011111100111111001111110100010001011110 3fb0cd3f3f3f3f8fbcfe3f3f3f443fb0cd3f3f3f3f8fbcfe3f3f3f445e
UTF-8 렱依렲센샹렱彧렶센셋D렱依렲센샹렱彧렶센셋D^ 111010111010000010110001111001001011111010011101111010111010000010110010111011001000010010111100111011001000001110111001111010111010000010110001111001011011110110100111111010111010000010110110111011001000010010111100111011001000010110001011010001001110101110100000101100011110010010111110100111011110101110100000101100101110110010000100101111001110110010000011101110011110101110100000101100011110010110111101101001111110101110100000101101101110110010000100101111001110110010000101100010110100010001011110 eba0b1e4be9deba0b2ec84bcec83b9eba0b1e5bda7eba0b6ec84bcec858b44eba0b1e4be9deba0b2ec84bcec83b9eba0b1e5bda7eba0b6ec84bcec858b445e
UHC 렱依렲센샹렱彧렶센셋D렱依렲센샹렱彧렶센셋D^ 10001110101111101110101111101110100011101011111110111100101111101011110010100111100011101011111011101001111011101000111011000001101111001011111010111100110000100100010010001110101111101110101111101110100011101011111110111100101111101011110010100111100011101011111011101001111011101000111011000001101111001011111010111100110000100100010001011110 8ebeebee8ebfbcbebca78ebee9ee8ec1bcbebcc2448ebeebee8ebfbcbebca78ebee9ee8ec1bcbebcc2445e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)