To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN ??)逾??猶 00111111001111111000000101101010111001111010010100111111001111111001011101010000 3f3f816ae7a53f3f9750
EUC-JP ??)逾??猶 00111111001111111010000111001011111011101010011100111111001111111100110110110001 3f3fa1cbeea73f3fcdb1
UTF-8 念잙)逾룟땔猶 111011111010011010100011111011001001111010011001111011111011110010001001111010011000000010111110111010111010001110011111111010111001010110010100111001111000110010110110 efa6a3ec9e99efbc89e980beeba39feb9594e78cb6
UHC 念잙)逾룟땔猶 1110011011110110100111111110101110100011101010011110101110110101101101111110010110110110101010101110101110100010 e6f69feba3a9ebb5b7e5b6aaeba2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)