To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN ??序ぴ??ぴ⊂ 001111110011111110001111100110001000001011010010001111110011111110000010110100101000000110111100 3f3f8f9882d23f3f82d281bc
EUC-JP ??序ぴ??ぴ⊂ 001111110011111110111101111110001010010011010100001111110011111110100100110101001010001010111110 3f3fbdf8a4d43f3fa4d4a2be
UTF-8 룶깹序ぴ룶혧ぴ⊂ 111010111010001110110110111010101011100110111001111001011011101010001111111000111000000110110100111010111010001110110110111011011001100010100111111000111000000110110100111000101000101010000010 eba3b6eab9b9e5ba8fe381b4eba3b6ed98a7e381b4e28a82
UHC 룶깹序ぴ룶혧ぴ⊂ 10001111101010111011001010100001110111111110110110101010110101001000111110101011110000101000111110101010110101001010000111111000 8fabb2a1dfedaad48fabc28faad4a1f8

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)