To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 罌??泣????? 1110001110100000001111110011111110001011100000110011111100111111001111110011111100111111 e3a03f3f8b833f3f3f3f3f
EUC-JP 罌??泣????? 1110011010100010001111110011111110110101111000110011111100111111001111110011111100111111 e6a23f3fb5e33f3f3f3f3f
UTF-8 罌븐옋泣뉑떤栒룹뭘 111001111011110110001100111010111011100010010000111011001001100010001011111001101011001110100011111010111000100110010001111010111001011010100100111001101010000010010010111010111010001110111001111010111010110110011000 e7bd8cebb890ec988be6b3a3eb8991eb96a4e6a092eba3b9ebad98
UHC 罌븐옋泣뉑떤栒룹뭘 111001011010001010111010111011001001111010010011111010111110100010000111111001101011011010110010111000101110001110110111111011001011100110111011 e5a2baec9e93ebe887e6b6b2e2e3b7ecb9bb

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)