To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ????惟??を?欲?忿?袁?咀醍?肢??B 001111110011111100111111001111111000100011010010001111110011111110000010111100000011111110010111011111100011111110011100011111000011111111100101110011010011111110011001111100001001000111100111001111111000111010001000001111110011111101000010 3f3f3f3f88d23f3f82f03f977e3f9c7c3fe5cd3f99f091e73f8e883f3f42
EUC-JP ????惟??を?欲?忿?袁?咀醍?肢??B 001111110011111100111111001111111011000011010100001111110011111110100100111100100011111111001101110111110011111111010111110111010011111111101010110011110011111111010010111100101100001011101001001111111011101111101000001111110011111101000010 3f3f3f3fb0d43f3fa4f23fcddf3fd7dd3feacf3fd2f2c2e93fbbe83f3f42
UTF-8 뤯헤ㅺ씨惟븟탮を떤欲핊忿렎袁얘咀醍렕肢꿴걋B 11101011101001001010111111101101100101111010010011100011100001011011101011101100100101001010100011100110100000111001111111101011101110001001111111101101100000111010111011100011100000101001001011101011100101101010010011100110101011001011001011101101100101011000101011100101101111111011111111101011101000001000111011101000101000101000000111101100100101101001100011100101100100101000000011101001100001101000110111101011101000001001010111101000100000101010001011101010101111111011010011101010101100011000101101000010 eba4afed97a4e385baec94a8e6839febb89fed83aee38292eb96a4e6acb2ed958ae5bfbfeba08ee8a281ec9698e59280e9868deba095e882a2eabfb4eab18b42
UHC 뤯헤ㅺ씨惟븟탮を떤欲핊忿렎袁얘咀醍렕肢꿴걋B 10001111110111011100011111101100101001001110101010111110101111101110101011101110101110101111000010110101100011101010101011110010101101101011001011101001101100001100000010001111110111011100100010001110101001001110101010111110101111101110101011101110101110101111000010110101100011101010101011110010101101101011001011101001101100001100000001000010 8fddc7eca4eabebeeaeebaf0b58eaaf2b6b2e9b0c08fddc88ea4eabebeeaeebaf0b58eaaf2b6b2e9b0c042

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)