To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 咫?????逡???咫?????逡???^ 10011010010000000011111100111111001111110011111100111111111001111001010100111111001111110011111110011010010000000011111100111111001111110011111100111111111001111001010100111111001111110011111101011110 9a403f3f3f3f3fe7953f3f3f9a403f3f3f3f3fe7953f3f3f5e
EUC-JP 咫?????逡???咫?????逡???^ 11010011101000010011111100111111001111110011111100111111111011011111010100111111001111110011111111010011101000010011111100111111001111110011111100111111111011011111010100111111001111110011111101011110 d3a13f3f3f3f3fedf53f3f3fd3a13f3f3f3f3fedf53f3f3f5e
UTF-8 咫렧理면렧렱逡淚렏렕咫렧理면렧렱逡淚렏렕^ 11100101100100101010101111101011101000001010011111101111101001111010010011101011101010011011010011101011101000001010011111101011101000001011000111101001100000001010000111101111101001011000110111101011101000001000111111101011101000001001010111100101100100101010101111101011101000001010011111101111101001111010010011101011101010011011010011101011101000001010011111101011101000001011000111101001100000001010000111101111101001011000110111101011101000001000111111101011101000001001010101011110 e592abeba0a7efa7a4eba9b4eba0a7eba0b1e980a1efa58deba08feba095e592abeba0a7efa7a4eba9b4eba0a7eba0b1e980a1efa58deba08feba0955e
UHC 咫렧理면렧렱逡淚렏렕咫렧理면렧렱逡淚렏렕^ 1111001010100001100011101011011011101100101101011011100011101001100011101011011010001110101111101111000111100100110100101110011110001110101001011000111010101010111100101010000110001110101101101110110010110101101110001110100110001110101101101000111010111110111100011110010011010010111001111000111010100101100011101010101001011110 f2a18eb6ecb5b8e98eb68ebef1e4d2e78ea58eaaf2a18eb6ecb5b8e98eb68ebef1e4d2e78ea58eaa5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)