To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 巽他俗竪息村巽続則 100100100100011010010001101111001001000110101101100100100100011110010001101001111001000110111010100100100100011010010001101100011001000110100101 924691bc91ad924791a791ba924691b191a5
EUC-JP 巽他俗竪息村巽続則 110000111010011111000010101111101100001010101111110000111010100011000010101010011100001010111100110000111010011111000010101100111100001010100111 c3a7c2bec2afc3a8c2a9c2bcc3a7c2b3c2a7
UTF-8 巽他俗竪息村巽続則 111001011011011110111101111001001011101110010110111001001011111110010111111001111010101110101010111001101000000110101111111001101001110110010001111001011011011110111101111001111011011010011010111001011000100110000111 e5b7bde4bb96e4bf97e7abaae681afe69d91e5b7bde7b69ae58987
UHC 巽他俗竪息村巽?則 1110000111011110111101101110001011100001110101001110001010110101111000111101001111110101101111011110000111011110001111111111011011001110 e1def6e2e1d4e2b5e3d3f5bde1de3ff6ce

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)