To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN ???泣????? 00111111001111110011111110001011100000110011111100111111001111110011111100111111 3f3f3f8b833f3f3f3f3f
EUC-JP ???泣????? 00111111001111110011111110110101111000110011111100111111001111110011111100111111 3f3f3fb5e33f3f3f3f3f
UTF-8 裂╉굩泣덂쮦琉룸돐 111011111010011010100000111000101001010110001001111010101011010110101001111001101011001110100011111010111000110110000010111011001010111010100110111011111010011110001100111010111010001110111000111010111000111110010000 efa6a0e29589eab5a9e6b3a3eb8d82ecaea6efa78ceba3b8eb8f90
UHC 裂╉굩泣덂쮦琉룸돐 111001101111000110100110111000111000001010001111111010111110100010001000111001011010100010000011111010111010010010110111111010111011010110111011 e6f1a6e3828febe888e5a883eba4b7ebb5bb

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)