To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????E 0011111100111111001111110011111100111111001111110011111100111111001111110011111101000101 3f3f3f3f3f3f3f3f3f3f45
SJIS-WIN 蒻??宥?蒻??宥?E 111001001110100000111111001111111001011101000111001111111110010011101000001111110011111110010111010001110011111101000101 e4e83f3f97473fe4e83f3f97473f45
EUC-JP 蒻??宥?蒻??宥?E 111010001110101000111111001111111100110110101000001111111110100011101010001111110011111111001101101010000011111101000101 e8ea3f3fcda83fe8ea3f3fcda83f45
UTF-8 蒻몃쪉宥뻒蒻몃쪉宥뻖E 11101000100100101011101111101011101010101000001111101100101010101000100111100101101011101010010111101011101110111001001011101000100100101011101111101011101010101000001111101100101010101000100111100101101011101010010111101011101110111001011001000101 e892bbebaa83ecaa89e5aea5ebbb92e892bbebaa83ecaa89e5aea5ebbb9645
UHC 蒻몃쪉宥뻒蒻몃쪉宥뻖E 111001011011011010111000111010111010010110000011111010101110100110010110010110011110010110110110101110001110101110100101100000111110101011101001100101100110001001000101 e5b6b8eba583eae99659e5b6b8eba583eae9966245

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)