To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 淫??淫??淫??沃 1000100011111010001111110011111110001000111110100011111100111111100010001111101000111111001111111001011110000000 88fa3f3f88fa3f3f88fa3f3f9780
EUC-JP 淫??淫??淫??沃 1011000011111100001111110011111110110000111111000011111100111111101100001111110000111111001111111100110111100000 b0fc3f3fb0fc3f3fb0fc3f3fcde0
UTF-8 淫몄꽎淫ㅵ젇淫욎꽕沃 111001101011011110101011111010111010101010000100111010101011110110001110111001101011011110101011111000111000010110110101111011001010000010000111111001101011011110101011111011001001101010001110111010101011110110010101111001101011001010000011 e6b7abebaa84eabd8ee6b7abe385b5eca087e6b7abec9a8eeabd95e6b283
UHC 淫몄꽎淫ㅵ젇淫욎꽕沃 1110101111100010101110001110110010000100100111101110101111100010101001001110010110100000100010101110101111100010100111101110110010000100101001001110100010101010 ebe2b8ec849eebe2a4e5a08aebe29eec84a4e8aa

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)