To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????}B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110111110101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f7d42
SJIS-WIN ????????????????????}B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110111110101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f7d42
EUC-JP ????????????????????}B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110111110101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f7d42
UTF-8 렻렜렺렊렺셍렚렻렟렺렎렻매렕렺셍렎렺셍렮}B 1110101110100000101110111110101110100000100111001110101110100000101110101110101110100000100010101110101110100000101110101110110010000101100011011110101110100000100110101110101110100000101110111110101110100000100111111110101110100000101110101110101110100000100011101110101110100000101110111110101110100111101001001110101110100000100101011110101110100000101110101110110010000101100011011110101110100000100011101110101110100000101110101110110010000101100011011110101110100000101011100111110101000010 eba0bbeba09ceba0baeba08aeba0baec858deba09aeba0bbeba09feba0baeba08eeba0bbeba7a4eba095eba0baec858deba08eeba0baec858deba0ae7d42
UHC 렻렜렺렊렺셍렚렻렟렺렎렻매렕렺셍렎렺셍렮}B 100011101100001110001110101011101000111011000010100011101010000110001110110000101011110011000100100011101010110110001110110000111000111010110000100011101100001010001110101001001000111011000011101110001100010110001110101010101000111011000010101111001100010010001110101001001000111011000010101111001100010010001110101110110111110101000010 8ec38eae8ec28ea18ec2bcc48ead8ec38eb08ec28ea48ec3b8c58eaa8ec2bcc48ea48ec2bcc48ebb7d42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)