To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????}v????????}vB 001111110011111100111111001111110011111100111111001111110011111101111101011101100011111100111111001111110011111100111111001111110011111100111111011111010111011001000010 3f3f3f3f3f3f3f3f7d763f3f3f3f3f3f3f3f7d7642
SJIS-WIN 陬晤ュ戎陬晤ュ」}v陬晤ュ戎陬晤ュ」}vB 11101000101000111001110111101011101011011000111101011110111010001010001110011101111010111010110110100011011111010111011011101000101000111001110111101011101011011000111101011110111010001010001110011101111010111010110110100011011111010111011001000010 e8a39debad8f5ee8a39debada37d76e8a39debad8f5ee8a39debada37d7642
EUC-JP 陬晤ュ戎陬晤ュ」}v陬晤ュ戎陬晤ュ」}vB 11110000101001011101101011101101100011101010110110111101101111111111000010100101110110101110110110001110101011011000111010100011011111010111011011110000101001011101101011101101100011101010110110111101101111111111000010100101110110101110110110001110101011011000111010100011011111010111011001000010 f0a5daed8eadbdbff0a5daed8ead8ea37d76f0a5daed8eadbdbff0a5daed8ead8ea37d7642
UTF-8 陬晤ュ戎陬晤ュ」}v陬晤ュ戎陬晤ュ」}vB 1110100110011001101011001110011010011001101001001110111110111101101011011110011010001000100011101110100110011001101011001110011010011001101001001110111110111101101011011110111110111101101000110111110101110110111010011001100110101100111001101001100110100100111011111011110110101101111001101000100010001110111010011001100110101100111001101001100110100100111011111011110110101101111011111011110110100011011111010111011001000010 e999ace699a4efbdade6888ee999ace699a4efbdadefbda37d76e999ace699a4efbdade6888ee999ace699a4efbdadefbda37d7642
UHC ?晤?戎?晤??}v?晤?戎?晤??}vB 001111111110011111111011001111111110101111010100001111111110011111111011001111110011111101111101011101100011111111100111111110110011111111101011110101000011111111100111111110110011111100111111011111010111011001000010 3fe7fb3febd43fe7fb3f3f7d763fe7fb3febd43fe7fb3f3f7d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)