To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????un}?????????un{^ 00111111001111110011111100111111001111110011111100111111001111110011111101110101011011100111110100111111001111110011111100111111001111110011111100111111001111110011111101110101011011100111101101011110 3f3f3f3f3f3f3f3f3f756e7d3f3f3f3f3f3f3f3f3f756e7b5e
SJIS-WIN ?????????un}?????????un{^ 00111111001111110011111100111111001111110011111100111111001111110011111101110101011011100111110100111111001111110011111100111111001111110011111100111111001111110011111101110101011011100111101101011110 3f3f3f3f3f3f3f3f3f756e7d3f3f3f3f3f3f3f3f3f756e7b5e
EUC-JP ?????????un}?????????un{^ 00111111001111110011111100111111001111110011111100111111001111110011111101110101011011100111110100111111001111110011111100111111001111110011111100111111001111110011111101110101011011100111101101011110 3f3f3f3f3f3f3f3f3f756e7d3f3f3f3f3f3f3f3f3f756e7b5e
UTF-8 챔혵혳채쨍혷챔혴짚un}챔혵혳채쨍혷챔혴짚un{^ 11101100101100011001010011101101100110001011010111101101100110001011001111101100101100011000010011101100101010001000110111101101100110001011011111101100101100011001010011101101100110001011010011101100101001111001101001110101011011100111110111101100101100011001010011101101100110001011010111101101100110001011001111101100101100011000010011101100101010001000110111101101100110001011011111101100101100011001010011101101100110001011010011101100101001111001101001110101011011100111101101011110 ecb194ed98b5ed98b3ecb184eca88ded98b7ecb194ed98b4eca79a756e7decb194ed98b5ed98b3ecb184eca88ded98b7ecb194ed98b4eca79a756e7b5e
UHC 챔혵혳채쨍혷챔혴짚un}챔혵혳채쨍혷챔혴짚un{^ 11000011101010001100001010011100110000101001101011000011101001001100001010111000110000101001111011000011101010001100001010011011110000101010010001110101011011100111110111000011101010001100001010011100110000101001101011000011101001001100001010111000110000101001111011000011101010001100001010011011110000101010010001110101011011100111101101011110 c3a8c29cc29ac3a4c2b8c29ec3a8c29bc2a4756e7dc3a8c29cc29ac3a4c2b8c29ec3a8c29bc2a4756e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)