To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 娼チ娼テ。ォョ。娼チ娼テ。ォョ。B 100011111010100111000001100011111010100111000011101000011111100010001111101010111010111010100001111100011010000111110010011010011000111110101001110000011000111110101001110000111010000111111000100011111010101110101110101000011111000110100001111100100110100101000010 8fa9c18fa9c3a1f88fabaea1f1a1f2698fa9c18fa9c3a1f88fabaea1f1a1f26942
EUC-JP 娼チ娼テ。?ォョ。??娼チ娼テ。?ォョ。??B 101111101010101110001110110000011011111010101011100011101100001110001110101000010011111110001110101010111000111010101110100011101010000100111111001111111011111010101011100011101100000110111110101010111000111011000011100011101010000100111111100011101010101110001110101011101000111010100001001111110011111101000010 beab8ec1beab8ec38ea13f8eab8eae8ea13f3fbeab8ec1beab8ec38ea13f8eab8eae8ea13f3f42
UTF-8 娼チ娼テ。ォョ。娼チ娼テ。ォョ。B 11100101101010001011110011101111101111101000000111100101101010001011110011101111101111101000001111101111101111011010000111101110100110001010111011101111101111011010101111101111101111011010111011101111101111011010000111101110100001001001110011101110100001101010000111100101101010001011110011101111101111101000000111100101101010001011110011101111101111101000001111101111101111011010000111101110100110001010111011101111101111011010101111101111101111011010111011101111101111011010000111101110100001001001110011101110100001101010000101000010 e5a8bcefbe81e5a8bcefbe83efbda1ee98aeefbdabefbdaeefbda1ee849cee86a1e5a8bcefbe81e5a8bcefbe83efbda1ee98aeefbdabefbdaeefbda1ee849cee86a142
UHC 娼?娼????????娼?娼????????B 111100111101111000111111111100111101111000111111001111110011111100111111001111110011111100111111001111111111001111011110001111111111001111011110001111110011111100111111001111110011111100111111001111110011111101000010 f3de3ff3de3f3f3f3f3f3f3f3ff3de3ff3de3f3f3f3f3f3f3f3f42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)