To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 螟耶キ怜、耶キ与 11100101101001001001011011101011101101111001011111100101101001001001011011101011101101111001011101011110 e5a496ebb797e5a496ebb7975e
EUC-JP 螟耶キ怜、耶キ与 11101010101001101100110011101101100011101011011111001110111001111000111010100100110011001110110110001110101101111100110110111111 eaa6cced8eb7cee78ea4cced8eb7cdbf
UTF-8 螟耶キ怜、耶キ与 111010001001111010011111111010001000000010110110111011111011110110110111111001101000000010011100111011111011110110100100111010001000000010110110111011111011110110110111111001001011100010001110 e89e9fe880b6efbdb7e6809cefbda4e880b6efbdb7e4b88e
UHC 螟耶?怜?耶?? 110110011010110111100101101011010011111111010110101110110011111111100101101011010011111100111111 d9ade5ad3fd6bb3fe5ad3f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)