To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 厭??獄れ?洵? 100010010111110100111111001111111000110110010110100000101110101000111111100111111010101100111111 897d3f3f8d9682ea3f9fab3f
EUC-JP 厭??獄れ?洵? 101100011101111000111111001111111011100111110110101001001110110000111111110111101010110100111111 b1de3f3fb9f6a4ec3fdead3f
UTF-8 厭묐젎獄れ뇯洵쯓 111001011000111010101101111010111010110010010000111011001010000010001110111001111000110110000100111000111000001010001100111010111000011110101111111001101011010010110101111011001010111110010011 e58eadebac90eca08ee78d84e3828ceb87afe6b4b5ecaf93
UHC 厭묐젎獄れ뇯洵쯓 11100110111101001001000111101011101000001000111111101000101010111010101011101100100001111001010011100010111001111010100101001111 e6f491eba08fe8abaaec8794e2e7a94f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)