To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 愛??娃??染??? 10001000101001000011111100111111100010001010000100111111001111111001000011110101001111110011111100111111 88a43f3f88a13f3f90f53f3f3f
EUC-JP 愛??娃??染??? 10110000101001100011111100111111101100001010001100111111001111111100000011110111001111110011111100111111 b0a63f3fb0a33f3fc0f73f3f3f
UTF-8 愛곷졁娃뗫젽染볝끆溜 111001101000010010011011111010101011001110110111111011001010000110000001111001011010100010000011111010111001011110101011111011001010000010111101111001101001111110010011111010111011001110011101111010111000000110000110111011111010011110001011 e6849beab3b7eca181e5a883eb97abeca0bde69f93ebb39deb8186efa78b
UHC 愛곷졁娃뗫젽染볝끆溜 1110010011110001100000011110101110100000101100101110100011011111100010111110101110100000101011111110011011111000100100111110001110000101101110101110101011111110 e4f181eba0b2e8df8beba0afe6f893e385baeafe

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)