To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??溢??儀?? 111000011001111100111111001111111000100011101100001111110011111110001011010101100011111100111111 e19f3f3f88ec3f3f8b563f3f
EUC-JP 癲??溢??儀?? 111000101010000100111111001111111011000011101110001111110011111110110101101101110011111100111111 e2a13f3fb0ee3f3fb5b73f3f
UTF-8 癲쀫쪋溢욑쭕儀숈춷 111001111001100110110010111011001000000010101011111011001010101010001011111001101011101010100010111011001001101010010001111011001010110110010101111001011000010010000000111011001000100010001000111011001011011010110111 e799b2ec80abecaa8be6baa2ec9a91ecad95e58480ec8888ecb6b7
UHC 癲쀫쪋溢욑쭕儀숈춷 111011111010011010010111111010111010010110000101111011001110111010011110111011111010011110001101111010111111000010011001111011001010110110010011 efa697eba585ecee9eefa78debf099ecad93

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)