To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??宗??壬??????壬???????壬?? 001111110011111110001111010000000011111100111111100100000111000000111111001111110011111100111111001111110011111110010000011100000011111100111111001111110011111100111111001111110011111110010000011100000011111100111111 3f3f8f403f3f90703f3f3f3f3f3f90703f3f3f3f3f3f3f90703f3f
EUC-JP ??宗??壬??????壬???????壬?? 001111110011111110111101101000010011111100111111101111111101000100111111001111110011111100111111001111110011111110111111110100010011111100111111001111110011111100111111001111110011111110111111110100010011111100111111 3f3fbda13f3fbfd13f3f3f3f3f3fbfd13f3f3f3f3f3f3fbfd13f3f
UTF-8 셈석宗렯렚壬렯렚렯렍셈섞壬렯렚렯롕렱렶섈壬렯렚 111011001000010110001000111011001000010010011101111001011010111010010111111010111010000010101111111010111010000010011010111001011010001110101100111010111010000010101111111010111010000010011010111010111010000010101111111010111010000010001101111011001000010110001000111011001000010010011110111001011010001110101100111010111010000010101111111010111010000010011010111010111010000010101111111010111010000110010101111010111010000010110001111010111010000010110110111011001000010010001000111001011010001110101100111010111010000010101111111010111010000010011010 ec8588ec849de5ae97eba0afeba09ae5a3aceba0afeba09aeba0afeba08dec8588ec849ee5a3aceba0afeba09aeba0afeba195eba0b1eba0b6ec8488e5a3aceba0afeba09a
UHC 셈석宗렯렚壬렯렚렯렍셈섞壬렯렚렯롕렱렶섈壬렯렚 10111100110000001011110010101110111100001111001110001110101111001000111010101101111011001111001110001110101111001000111010101101100011101011110010001110101000111011110011000000101111001010111111101100111100111000111010111100100011101010110110001110101111001000111011011001100011101011111010001110110000011011110010101010111011001111001110001110101111001000111010101101 bcc0bcaef0f38ebc8eadecf38ebc8ead8ebc8ea3bcc0bcafecf38ebc8ead8ebc8ed98ebe8ec1bcaaecf38ebc8ead

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)