To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????O 00111111001111110011111100111111001111110011111100111111001111110011111101001111 3f3f3f3f3f3f3f3f3f4f
SJIS-WIN ??????儒??O 0011111100111111001111110011111100111111001111111000111011110010001111110011111101001111 3f3f3f3f3f3f8ef23f3f4f
EUC-JP ???堉??儒??O 00111111001111110011111110001111101101111111110100111111001111111011110011110100001111110011111101001111 3f3f3f8fb7fd3f3fbcf43f3f4f
UTF-8 療낆늹堉삼쮬儒묒쑠O 11101111101001111000000111101011100000101000011011101011100010101011100111100101101000001000100111101100100000101011110011101100101011101010110011100101100001001001001011101011101011001001001011101100100100011010000001001111 efa781eb8286eb8ab9e5a089ec82bcecaeace58492ebac92ec91a04f
UHC 療낆늹堉삼쮬儒묒쑠O 11101000111111101000010111101100100010001000001011101011101111001011101111101111101010001000100111101010111000111001000111101100100111001011111101001111 e8fe85ec8882ebbcbbefa889eae391ec9cbf4f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)