To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN ???肛???? 001111110011111100111111111000111110100000111111001111110011111100111111 3f3f3fe3e83f3f3f3f
EUC-JP ???肛???? 001111110011111100111111111001101110101000111111001111110011111100111111 3f3f3fe6ea3f3f3f3f
UTF-8 了묕슭肛됵슬樂첕 111011111010011010111010111010111010110010010101111011001000101010101101111010001000001010011011111010111001000010110101111011001000101010101100111011111010011010111111111011001011001010010101 efa6baebac95ec8aade8829beb90b5ec8aacefa6bfecb295
UHC 了묕슭肛됵슬樂첕 11101000111001111001000111101111101111011011111011111001111111011000100111101111101111011011110111101000111110011010101101000010 e8e791efbdbef9fd89efbdbde8f9ab42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)