To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 邯懦失邏危、域ウー螳冗カ懦失邏危、域ウー讒真 11100111101101101001110011101101100011101011100011100111101101001000101011101011101001001000100011100110101100111011000011100101101011101000111111100111101101101001110011101101100011101011100011100111101101001000101011101011101001001000100011100110101100111011000011100110101001111001000001011110 e7b69ced8eb8e7b48aeba488e6b3b0e5ae8fe7b69ced8eb8e7b48aeba488e6b3b0e6a7905e
EUC-JP 邯懦失邏危、域ウー螳冗カ懦失邏危、域ウー讒真 1110111010111000110110001110111110111100101110101110111010110110101101001110110110001110101001001011000011101000100011101011001110001110101100001110101010110000101111101110100110001110101101101101100011101111101111001011101011101110101101101011010011101101100011101010010010110000111010001000111010110011100011101011000011101100101010011011111110111111 eeb8d8efbcbaeeb6b4ed8ea4b0e88eb38eb0eab0bee98eb6d8efbcbaeeb6b4ed8ea4b0e88eb38eb0eca9bfbf
UTF-8 邯懦失邏危、域ウー螳冗カ懦失邏危、域ウー讒真 111010011000001010101111111001101000011110100110111001011010010010110001111010011000001010001111111001011000110110110001111011111011110110100100111001011001111110011111111011111011110110110011111011111011110110110000111010001001111010110011111001011000011010010111111011111011110110110110111001101000011110100110111001011010010010110001111010011000001010001111111001011000110110110001111011111011110110100100111001011001111110011111111011111011110110110011111011111011110110110000111010001010111010010010111001111001110010011111 e982afe687a6e5a4b1e9828fe58db1efbda4e59f9fefbdb3efbdb0e89eb3e58697efbdb6e687a6e5a4b1e9828fe58db1efbda4e59f9fefbdb3efbdb0e8ae92e79c9f
UHC 邯懦失邏危?域??螳冗?懦失邏危?域??讒? 110010101111101111010001110101111110001111110111110101011010010011101010110010110011111111100110101101000011111100111111110100111101100111101001101101110011111111010001110101111110001111110111110101011010010011101010110010110011111111100110101101000011111100111111111100111101100000111111 cafbd1d7e3f7d5a4eacb3fe6b43f3fd3d9e9b73fd1d7e3f7d5a4eacb3fe6b43f3ff3d83f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)