To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 娼チ「将タ。娼チ「将タ。B 10001111101010011100000110100010111101111111101110001111101010111100000010100001111100001111101110001111101010011100000110100010111101111111101110001111101010111100000010100001111100001111101101000010 8fa9c1a2f7fb8fabc0a1f0fb8fa9c1a2f7fb8fabc0a1f0fb42
EUC-JP 娼チ「?将タ。?娼チ「?将タ。?B 1011111010101011100011101100000110001110101000100011111110111110101011011000111011000000100011101010000100111111101111101010101110001110110000011000111010100010001111111011111010101101100011101100000010001110101000010011111101000010 beab8ec18ea23fbead8ec08ea13fbeab8ec18ea23fbead8ec08ea13f42
UTF-8 娼チ「将タ。娼チ「将タ。B 11100101101010001011110011101111101111101000000111101111101111011010001011101110100101111001111011100101101100001000011011101111101111101000000011101111101111011010000111101110100000101011101011100101101010001011110011101111101111101000000111101111101111011010001011101110100101111001111011100101101100001000011011101111101111101000000011101111101111011010000111101110100000101011101001000010 e5a8bcefbe81efbda2ee979ee5b086efbe80efbda1ee82bae5a8bcefbe81efbda2ee979ee5b086efbe80efbda1ee82ba42
UHC 娼???????娼???????B 11110011110111100011111100111111001111110011111100111111001111110011111111110011110111100011111100111111001111110011111100111111001111110011111101000010 f3de3f3f3f3f3f3f3ff3de3f3f3f3f3f3f3f42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)