To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鬯ゥ蟷守ウ也ケ楪ア鬯ゥ蟷守ウ也ケ橸シ疑 111010011010110010101001111001011011100110001110111001111011001110010110111001111011100110011110110000101011000111101001101011001010100111100101101110011000111011100111101100111001011011100111101110011001111011101111101111001000101101011110 e9aca9e5b98ee7b396e7b99ec2b1e9aca9e5b98ee7b396e7b99eefbc8b5e
EUC-JP 鬯ゥ蟷守ウ也ケ楪ア鬯ゥ蟷守ウ也ケ橸シ疑 1111001010101110100011101010100111101010101110111011110011101001100011101011001111001100111010011000111010111001110111001100010010001110101100011111001010101110100011101010100111101010101110111011110011101001100011101011001111001100111010011000111010111001110111001111000110001110101111001011010110111111 f2ae8ea9eabbbce98eb3cce98eb9dcc48eb1f2ae8ea9eabbbce98eb3cce98eb9dcf18ebcb5bf
UTF-8 鬯ゥ蟷守ウ也ケ楪ア鬯ゥ蟷守ウ也ケ橸シ疑 111010011010110010101111111011111011110110101001111010001001111110110111111001011010111010001000111011111011110110110011111001001011100110011111111011111011110110111001111001101010010110101010111011111011110110110001111010011010110010101111111011111011110110101001111010001001111110110111111001011010111010001000111011111011110110110011111001001011100110011111111011111011110110111001111001101010100110111000111011111011110110111100111001111001011010010001 e9acafefbda9e89fb7e5ae88efbdb3e4b99fefbdb9e6a5aaefbdb1e9acafefbda9e89fb7e5ae88efbdb3e4b99fefbdb9e6a9b8efbdbce79691
UHC ???守?也??????守?也???疑 001111110011111100111111111000011111101000111111111001011010010100111111001111110011111100111111001111110011111111100001111110100011111111100101101001010011111100111111001111111110101111110111 3f3f3fe1fa3fe5a53f3f3f3f3f3fe1fa3fe5a53f3f3febf7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)