To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 淫???物鬱?諍? 10001000111110100011111100111111001111111001010110101000100111110101010000111111111001100111100100111111 88fa3f3f3f95a89f543fe6793f
EUC-JP 淫???物鬱?諍? 10110000111111000011111100111111001111111100101010101010110111011011010100111111111010111101101000111111 b0fc3f3f3fcaaaddb53febda3f
UTF-8 淫면렗띕物鬱렩諍렧 111001101011011110101011111010111010100110110100111010111010000010010111111010111001110110010101111001111000100110101001111010011010110010110001111010111010000010101001111010001010101110001101111010111010000010100111 e6b7abeba9b4eba097eb9d95e789a9e9acb1eba0a9e8ab8deba0a7
UHC 淫면렗띕物鬱렩諍렧 111010111110001010111000111010011000111010101100101101101110101111011010101010101110101010100110100011101011011111101110101101011000111010110110 ebe2b8e98eacb6ebdaaaeaa68eb7eeb58eb6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)