To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????[??????????[^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111101011011001111110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN ????ぃ????ぃ[????ぃ????ぃ[^ 001111110011111100111111001111111000001010100001001111110011111100111111001111111000001010100001010110110011111100111111001111110011111110000010101000010011111100111111001111110011111110000010101000010101101101011110 3f3f3f3f82a13f3f3f3f82a15b3f3f3f3f82a13f3f3f3f82a15b5e
EUC-JP ????ぃ????ぃ[????ぃ????ぃ[^ 001111110011111100111111001111111010010010100011001111110011111100111111001111111010010010100011010110110011111100111111001111110011111110100100101000110011111100111111001111110011111110100100101000110101101101011110 3f3f3f3fa4a33f3f3f3fa4a35b3f3f3f3fa4a33f3f3f3fa4a35b5e
UTF-8 룶깰룶쥚ぃ룶깰룶쥚ぃ[룶깰룶쥚ぃ룶깰룶쥚ぃ[^ 111010111010001110110110111010101011100110110000111010111010001110110110111011001010010110011010111000111000000110000011111010111010001110110110111010101011100110110000111010111010001110110110111011001010010110011010111000111000000110000011010110111110101110100011101101101110101010111001101100001110101110100011101101101110110010100101100110101110001110000001100000111110101110100011101101101110101010111001101100001110101110100011101101101110110010100101100110101110001110000001100000110101101101011110 eba3b6eab9b0eba3b6eca59ae38183eba3b6eab9b0eba3b6eca59ae381835beba3b6eab9b0eba3b6eca59ae38183eba3b6eab9b0eba3b6eca59ae381835b5e
UHC 룶깰룶쥚ぃ룶깰룶쥚ぃ[룶깰룶쥚ぃ룶깰룶쥚ぃ[^ 10001111101010111011000111111101100011111010101110100010100011111010101010100011100011111010101110110001111111011000111110101011101000101000111110101010101000110101101110001111101010111011000111111101100011111010101110100010100011111010101010100011100011111010101110110001111111011000111110101011101000101000111110101010101000110101101101011110 8fabb1fd8faba28faaa38fabb1fd8faba28faaa35b8fabb1fd8faba28faaa38fabb1fd8faba28faaa35b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)