To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN ??ュ??ゲ?淋?[??ュ??ゲ?淋?[^ 001111110011111110000011100001010011111100111111100000110101000100111111100101111101001000111111010110110011111100111111100000111000010100111111001111111000001101010001001111111001011111010010001111110101101101011110 3f3f83853f3f83513f97d23f5b3f3f83853f3f83513f97d23f5b5e
EUC-JP ??ュ??ゲ?淋?[??ュ??ゲ?淋?[^ 001111110011111110100101111001010011111100111111101001011011001000111111110011101101010000111111010110110011111100111111101001011110010100111111001111111010010110110010001111111100111011010100001111110101101101011110 3f3fa5e53f3fa5b23fced43f5b3f3fa5e53f3fa5b23fced43f5b5e
UTF-8 룶퀛ュ룴쵍ゲ룵淋헉[룶퀛ュ룴쵍ゲ룵淋헉[^ 111010111010001110110110111011011000000010011011111000111000001110100101111010111010001110110100111011001011010110001101111000111000001010110010111010111010001110110101111001101011011110001011111011011001011110001001010110111110101110100011101101101110110110000000100110111110001110000011101001011110101110100011101101001110110010110101100011011110001110000010101100101110101110100011101101011110011010110111100010111110110110010111100010010101101101011110 eba3b6ed809be383a5eba3b4ecb58de382b2eba3b5e6b78bed97895beba3b6ed809be383a5eba3b4ecb58de382b2eba3b5e6b78bed97895b5e
UHC 룶퀛ュ룴쵍ゲ룵淋헉[룶퀛ュ룴쵍ゲ룵淋헉[^ 100011111010101110110011100011111010101111100101100011111010100110101100100011111010101110110010100011111010101011010111111110101100011111100100010110111000111110101011101100111000111110101011111001011000111110101001101011001000111110101011101100101000111110101010110101111111101011000111111001000101101101011110 8fabb38fabe58fa9ac8fabb28faad7fac7e45b8fabb38fabe58fa9ac8fabb28faad7fac7e45b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)