To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 航??亨???i? 100011010111000100111111001111111000101110011100001111110011111100111111100000101000100100111111 8d713f3f8b9c3f3f3f82893f
EUC-JP 航??亨???i? 101110011101001000111111001111111011010111111100001111110011111100111111101000111110100100111111 b9d23f3fb5fc3f3f3fa3e93f
UTF-8 航ㅵ뤀亨꿨ㅁ降i눈 111010001000100010101010111000111000010110110101111010111010010010000000111001001011101010101000111010101011111110101000111000111000010110000001111011111010100010001001111011111011110110001001111010111000100010001000 e888aae385b5eba480e4baa8eabfa8e38581efa889efbd89eb8888
UHC 航ㅵ뤀亨꿨ㅁ降i눈 111110011111111010100100111001011000111110110001111110101111101110110010111001011010010010110001111110101010001010100011111010011011010010101011 f9fea4e58fb1fafbb2e5a4b1faa2a3e9b4ab

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)