To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN ???揄х?鵝?? 001111110011111100111111100111011000100110000100100001110011111111101010010000000011111100111111 3f3f3f9d8984873fea403f3f
EUC-JP ???揄х?鵝?? 001111110011111100111111110110011110100110100111111001110011111111110011101000010011111100111111 3f3f3fd9e9a7e73ff3a13f3f
UTF-8 琉쀭슀揄х씉鵝껇닄 1110111110100111100011001110110010000000101011011110110010001010100000001110011010001111100001001101000110000101111011001001010010001001111010011011010110011101111010101011101110000111111010111000101110000100 efa78cec80adec8a80e68f84d185ec9489e9b59deabb87eb8b84
UHC 琉쀭슀揄х씉鵝껇닄 111010111010010010010111111011011001101010010011111010101111000110101100111001111001110110100001111001001011110110000011111010001000100010001101 eba497ed9a93eaf1ace79da1e4bd83e8888d

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)