To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN ??????宋?? 00111111001111110011111100111111001111110011111110010001011101100011111100111111 3f3f3f3f3f3f91763f3f
EUC-JP ??????宋?Œ 001111110011111100111111001111110011111100111111110000011101011100111111100011111010100110101101 3f3f3f3f3f3fc1d73f8fa9ad
UTF-8 聯뤿씭留⒵쾮宋믪Œ 1110111110100110100101111110101110100100101111111110110010010100101011011110111110100111100011011110001010010010101101011110110010111110101011101110010110101110100010111110101110101111101010101100010110010010 efa697eba4bfec94adefa78de292b5ecbeaee5ae8bebafaac592
UHC 聯뤿씭留⒵쾮宋믪Œ 111001101110000110001111111010111001110110111110111010111010011110101001111001101011001010000101111000011110010010010010111011001010100010101011 e6e18feb9dbeeba7a9e6b285e1e492eca8ab

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)