To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ????踰??を?欲?忿?袁??醍?肢??B 0011111100111111001111110011111111100110111110100011111100111111100000101111000000111111100101110111111000111111100111000111110000111111111001011100110100111111001111111001000111100111001111111000111010001000001111110011111101000010 3f3f3f3fe6fa3f3f82f03f977e3f9c7c3fe5cd3f3f91e73f8e883f3f42
EUC-JP ????踰??を?欲?忿?袁??醍?肢??B 0011111100111111001111110011111111101100111111000011111100111111101001001111001000111111110011011101111100111111110101111101110100111111111010101100111100111111001111111100001011101001001111111011101111101000001111110011111101000010 3f3f3f3fecfc3f3fa4f23fcddf3fd7dd3feacf3f3fc2e93fbbe83f3f42
UTF-8 뤯헤ㅺ씨踰븟탮を떤欲핊忿렎袁얜껼醍렕肢꿴걋B 11101011101001001010111111101101100101111010010011100011100001011011101011101100100101001010100011101000101110001011000011101011101110001001111111101101100000111010111011100011100000101001001011101011100101101010010011100110101011001011001011101101100101011000101011100101101111111011111111101011101000001000111011101000101000101000000111101100100101101001110011101010101110111011110011101001100001101000110111101011101000001001010111101000100000101010001011101010101111111011010011101010101100011000101101000010 eba4afed97a4e385baec94a8e8b8b0ebb89fed83aee38292eb96a4e6acb2ed958ae5bfbfeba08ee8a281ec969ceabbbce9868deba095e882a2eabfb4eab18b42
UHC 뤯헤ㅺ씨踰븟탮を떤欲핊忿렎袁얜껼醍렕肢꿴걋B 10001111110111011100011111101100101001001110101010111110101111101110101110110010101110101111000010110101100011101010101011110010101101101011001011101001101100001100000010001111110111011100100010001110101001001110101010111110101111101110101110110010101110101111000010110101100011101010101011110010101101101011001011101001101100001100000001000010 8fddc7eca4eabebeebb2baf0b58eaaf2b6b2e9b0c08fddc88ea4eabebeebb2baf0b58eaaf2b6b2e9b0c042

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)