To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 嶸??饒??艶{┬嶸??饒??艶{┬B 1111101010110100001111110011111111101001011000000011111100111111100010011001000010000001011011111000010010100110111110101011010000111111001111111110100101100000001111110011111110001001100100001000000101101111100001001010011001000010 fab43f3fe9603f3f8990816f84a6fab43f3fe9603f3f8990816f84a642
EUC-JP 嶸??饒??艶{┬嶸??饒??艶{┬B 10001111101110111111010000111111001111111111000111000001001111110011111110110001111100001010000111010000101010001010100010001111101110111111010000111111001111111111000111000001001111110011111110110001111100001010000111010000101010001010100001000010 8fbbf43f3ff1c13f3fb1f0a1d0a8a88fbbf43f3ff1c13f3fb1f0a1d0a8a842
UTF-8 嶸뤹왃饒뤹컮艶{┬嶸뤹왃饒뤹컮艶{┬B 11100101101101101011100011101011101001001011100111101100100110011000001111101001101001011001001011101011101001001011100111101100101110111010111011101000100010011011011011101111101111011001101111100010100101001010110011100101101101101011100011101011101001001011100111101100100110011000001111101001101001011001001011101011101001001011100111101100101110111010111011101000100010011011011011101111101111011001101111100010100101001010110001000010 e5b6b8eba4b9ec9983e9a592eba4b9ecbbaee889b6efbd9be294ace5b6b8eba4b9ec9983e9a592eba4b9ecbbaee889b6efbd9be294ac42
UHC 嶸뤹왃饒뤹컮艶{┬嶸뤹왃饒뤹컮艶{┬B 11100111101011101000111111100111100111101011011011101001101011101000111111100111101100001001010011100110111111011010001111111011101001101010100011100111101011101000111111100111100111101011011011101001101011101000111111100111101100001001010011100110111111011010001111111011101001101010100001000010 e7ae8fe79eb6e9ae8fe7b094e6fda3fba6a8e7ae8fe79eb6e9ae8fe7b094e6fda3fba6a842

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)