To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 葉??猥る??k?葉??猥る??k?B 100101110111010000111111001111111110000011001110100000101110100100111111001111111000001010001011001111111001011101110100001111110011111111100000110011101000001011101001001111110011111110000010100010110011111101000010 97743f3fe0ce82e93f3f828b3f97743f3fe0ce82e93f3f828b3f42
EUC-JP 葉??猥る??k?葉??猥る??k?B 110011011101010100111111001111111110000011010000101001001110101100111111001111111010001111101011001111111100110111010101001111110011111111100000110100001010010011101011001111110011111110100011111010110011111101000010 cdd53f3fe0d0a4eb3f3fa3eb3fcdd53f3fe0d0a4eb3f3fa3eb3f42
UTF-8 葉뗫젫猥る떧溜k젩葉뗫젫猥る떧溜k젩B 11101000100100011000100111101011100101111010101111101100101000001010101111100111100011001010010111100011100000101000101111101011100101101010011111101111101001111000101111101111101111011000101111101100101000001010100111101000100100011000100111101011100101111010101111101100101000001010101111100111100011001010010111100011100000101000101111101011100101101010011111101111101001111000101111101111101111011000101111101100101000001010100101000010 e89189eb97abeca0abe78ca5e3828beb96a7efa78befbd8beca0a9e89189eb97abeca0abe78ca5e3828beb96a7efa78befbd8beca0a942
UHC 葉뗫젫猥る떧溜k젩葉뗫젫猥る떧溜k젩B 11100111101010001000101111101011101000001010001111101000111001011010101011101011100010111011101011101010111111101010001111101011101000001010000111100111101010001000101111101011101000001010001111101000111001011010101011101011100010111011101011101010111111101010001111101011101000001010000101000010 e7a88beba0a3e8e5aaeb8bbaeafea3eba0a1e7a88beba0a3e8e5aaeb8bbaeafea3eba0a142

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)