To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 臾?П???臾⑤た 1110010001101011001111111000010001010000001111110011111100111111111001000110101110000111010001001000001010111101 e46b3f84503f3f3fe46b874482bd
EUC-JP 臾?П???臾?た 11100111110011000011111110100111101100010011111100111111001111111110011111001100001111111010010010111111 e7cc3fa7b13f3f3fe7cc3fa4bf
UTF-8 臾븍П紐룸춡臾⑤た 1110100010000111101111101110101110111000100011011101000010011111111011111010011110001111111010111010001110111000111011001011011010100001111010001000011110111110111000101001000110100100111000111000000110011111 e887beebb88dd09fefa78feba3b8ecb6a1e887bee291a4e3819f
UHC 臾븍П紐룸춡臾⑤た 111010111010110010111010111010111010110010110001111010111010101010110111111010111010110110000010111010111010110010101000111010111010101010111111 ebacbaebacb1ebaab7ebad82ebaca8ebaabf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)