To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 蒻れ????淫?エ???肉??純?????悠? 111001001110100010000010111010100011111100111111001111110011111110001000111110100011111110000011010001110011111100111111001111111001001111110111001111110011111110001111100000110011111100111111001111110011111100111111100101110100100100111111 e4e882ea3f3f3f3f88fa3f83473f3f3f93f73f3f8f833f3f3f3f3f97493f
EUC-JP 蒻れ????淫?エ孼??肉??純?????悠? 1110100011101010101001001110110000111111001111110011111100111111101100001111110000111111101001011010100010001111101110101100001100111111001111111100011011111001001111110011111110111101111000110011111100111111001111110011111100111111110011011010101000111111 e8eaa4ec3f3f3f3fb0fc3fa5a88fbac33f3fc6f93f3fbde33f3f3f3f3fcdaa3f
UTF-8 蒻れ슜留뺡뇡淫볦エ孼뽦넂肉욘깱純볧닡捻꿸낯悠쁁 111010001001001010111011111000111000001010001100111011001000101010011100111011111010011110001101111010111011101010100001111010111000011110100001111001101011011110101011111010111011001110100110111000111000001010101000111001011010110110111100111010111011110110100110111010111000010010000010111010001000001010001001111011001001101010011000111010101011100110110001111001111011010010010100111010111011001110100111111010111000101110100001111011111010011010100100111010101011111110111000111010111000001010101111111001101000001010100000111011001000000110000001 e892bbe3828cec8a9cefa78debbaa1eb87a1e6b7abebb3a6e382a8e5adbcebbda6eb8482e88289ec9a98eab9b1e7b494ebb3a7eb8ba1efa6a4eabfb8eb82afe682a0ec8181
UHC 蒻れ슜留뺡뇡淫볦エ孼뽦넂肉욘깱純볧닡捻꿸낯悠쁁 11100101101101101010101011101100100110101010100111101011101001111001010111101001100001111000100111101011111000101001001111101100101010111010100011100101111011011001011011100010100001101001001011101011101111111011111111100110100000111001111111100010111011011001001111101101100010001010000111100110111101111011001011101010101100111011100011101010111011011001100001000010 e5b6aaec9aa9eba795e98789ebe293ecaba8e5ed96e28692ebbfbfe6839fe2ed93ed88a1e6f7b2eab3b8eaed9842

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)