To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN ???與??厭?? 0011111100111111001111111110010001101111001111110011111110001001011111010011111100111111 3f3f3fe46f3f3f897d3f3f
EUC-JP ???與??厭?? 0011111100111111001111111110011111010000001111110011111110110001110111100011111100111111 3f3f3fe7d03f3fb1de3f3f
UTF-8 歷잙젺與쀫젗厭묐뜶 111011111010011010001100111011001001111010011001111011001010000010111010111010001000100010000111111011001000000010101011111011001010000010010111111001011000111010101101111010111010110010010000111010111001110010110110 efa68cec9e99eca0bae88887ec80abeca097e58eadebac90eb9cb6
UHC 歷잙젺與쀫젗厭묐뜶 111001101011100010011111111010111010000010101101111001101010100010010111111010111010000010010011111001101111010010010001111010111000110110110100 e6b89feba0ade6a897eba093e6f491eb8db4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)