To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 莉匁キ。讓ス貉 1110010010111011100101101110011010110111101000011110011010101000101111011110011010111001 e4bb96e6b7a1e6a8bde6b9
EUC-JP 莉匁キ。讓ス貉 1110100010111101110011001110100010001110101101111000111010100001111011001010101010001110101111011110110010111011 e8bdcce88eb78ea1ecaa8ebdecbb
UTF-8 莉匁キ。讓ス貉 111010001000111010001001111001011000110010000001111011111011110110110111111011111011110110100001111010001010111010010011111011111011110110111101111010001011001010001001 e88e89e58c81efbdb7efbda1e8ae93efbdbde8b289
UHC 莉???讓?? 110101111110100100111111001111110011111111100101110100110011111100111111 d7e93f3f3fe5d33f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)