To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????B 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ???????靈?伯???????靈?伯B 00111111001111110011111100111111001111110011111100111111111010001100101100111111100101001000110000111111001111110011111100111111001111110011111100111111111010001100101100111111100101001000110001000010 3f3f3f3f3f3f3fe8cb3f948c3f3f3f3f3f3f3fe8cb3f948c42
EUC-JP ???????靈?伯???????靈?伯B 00111111001111110011111100111111001111110011111100111111111100001100110100111111110001111110110000111111001111110011111100111111001111110011111100111111111100001100110100111111110001111110110001000010 3f3f3f3f3f3f3ff0cd3fc7ec3f3f3f3f3f3f3ff0cd3fc7ec42
UTF-8 렺씻렯렻렱렺셔靈셔伯렺씻렯렻렱렺셔靈셔伯B 11101011101000001011101011101100100101001011101111101011101000001010111111101011101000001011101111101011101000001011000111101011101000001011101011101100100001011001010011101001100111011000100011101100100001011001010011100100101111001010111111101011101000001011101011101100100101001011101111101011101000001010111111101011101000001011101111101011101000001011000111101011101000001011101011101100100001011001010011101001100111011000100011101100100001011001010011100100101111001010111101000010 eba0baec94bbeba0afeba0bbeba0b1eba0baec8594e99d88ec8594e4bcafeba0baec94bbeba0afeba0bbeba0b1eba0baec8594e99d88ec8594e4bcaf42
UHC 렺씻렯렻렱렺셔靈셔伯렺씻렯렻렱렺셔靈셔伯B 1000111011000010101111101100010010001110101111001000111011000011100011101011111010001110110000101011110011000101110101101100010010111100110001011101101111010111100011101100001010111110110001001000111010111100100011101100001110001110101111101000111011000010101111001100010111010110110001001011110011000101110110111101011101000010 8ec2bec48ebc8ec38ebe8ec2bcc5d6c4bcc5dbd78ec2bec48ebc8ec38ebe8ec2bcc5d6c4bcc5dbd742

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)