To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????\??????\?????B 0011111100111111001111110011111100111111001111110011111100111111001111110101110000111111001111110011111100111111001111110011111101011100001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f5c3f3f3f3f3f3f5c3f3f3f3f3f42
SJIS-WIN ???似?????\???】??\?????B 00111111001111110011111110001110100101110011111100111111001111110011111100111111010111000011111100111111001111111000000101111010001111110011111101011100001111110011111100111111001111110011111101000010 3f3f3f8e973f3f3f3f3f5c3f3f3f817a3f3f5c3f3f3f3f3f42
EUC-JP ???似?????\???】??\?????B 00111111001111110011111110111011111101110011111100111111001111110011111100111111010111000011111100111111001111111010000111011011001111110011111101011100001111110011111100111111001111110011111101000010 3f3f3fbbf73f3f3f3f3f5c3f3f3fa1db3f3f5c3f3f3f3f3f42
UTF-8 렻렎렺似씻씻슨렻렓\렺봬걜】렻렓\렺셔렚렺슝B 111010111010000010111011111010111010000010001110111010111010000010111010111001001011110010111100111011001001010010111011111011001001010010111011111011001000101010101000111010111010000010111011111010111010000010010011010111001110101110100000101110101110101110110100101011001110101010110001100111001110001110000000100100011110101110100000101110111110101110100000100100110101110011101011101000001011101011101100100001011001010011101011101000001001101011101011101000001011101011101100100010101001110101000010 eba0bbeba08eeba0bae4bcbcec94bbec94bbec8aa8eba0bbeba0935ceba0baebb4aceab19ce38091eba0bbeba0935ceba0baec8594eba09aeba0baec8a9d42
UHC 렻렎렺似씻씻슨렻렓\렺봬걜】렻렓\렺셔렚렺슝B 10001110110000111000111010100100100011101100001011011110110001001011111011000100101111101100010010111101101111001000111011000011100011101010100001011100100011101100001010111010110001001011000011000100101000011011110110001110110000111000111010101000010111001000111011000010101111001100010110001110101011011000111011000010101111011011100101000010 8ec38ea48ec2dec4bec4bec4bdbc8ec38ea85c8ec2bac4b0c4a1bd8ec38ea85c8ec2bcc58ead8ec2bdb942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)