To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 蒻れ??f?宥??嚥〓?愉??邑???l?^ 11100100111010001000001011101010001111110011111110000010100001100011111110010111010001110011111100111111100110101000101110000001101011000011111110010110111110010011111100111111100101110101011100111111001111110011111110000010100011000011111101011110 e4e882ea3f3f82863f97473f3f9a8b81ac3f96f93f3f97573f3f3f828c3f5e
EUC-JP 蒻れ??f?宥??嚥〓?愉??邑???l?^ 11101000111010101010010011101100001111110011111110100011111001100011111111001101101010000011111100111111110100111110101110100010101011100011111111001100111110110011111100111111110011011011100000111111001111110011111110100011111011000011111101011110 e8eaa4ec3f3fa3e63fcda83f3fd3eba2ae3fccfb3f3fcdb83f3f3fa3ec3f5e
UTF-8 蒻れ슦杻f룚宥밸룆嚥〓끃愉뚦슖邑뀁뒃力l뇹^ 11101000100100101011101111100011100000101000110011101100100010101010011011101111101001111000100011101111101111011000011011101011101000111001101011100101101011101010010111101011101100001011100011101011101000111000011011100101100110101010010111100011100000001001001111101011100000011000001111100110100001001000100111101011100110101010011011101100100010101001011011101001100000101001000111101011100000001000000111101011100100101000001111101111101001101000101011101111101111011000110011101011100001111011100101011110 e892bbe3828cec8aa6efa788efbd86eba39ae5aea5ebb0b8eba386e59aa5e38093eb8183e68489eb9aa6ec8a96e98291eb8081eb9283efa68aefbd8ceb87b95e
UHC 蒻れ슦杻f룚宥밸룆嚥〓끃愉뚦슖邑뀁뒃力l뇹^ 11100101101101101010101011101100100110101011000011101010111101001010001111100110100011111001011011101010111010011011100111101011100011111000010111100110101111111010000111101011100001011011100111101010111100001000110011100101100110101010010111101011111010011011001011101100100010101000000111100110101100111010001111101100101101001010011001011110 e5b6aaec9ab0eaf4a3e68f96eae9b9eb8f85e6bfa1eb85b9eaf08ce59aa5ebe9b2ec8a81e6b3a3ecb4a65e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)