To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ????▼?矣??艶b?醫??嚥▲?魏 001111110011111100111111001111111000000110100101001111111110000111100001001111110011111110001001100100001000001010000010001111111110011111001110001111110011111110011010100010111000000110100011001111111110100110110000 3f3f3f3f81a53fe1e13f3f899082823fe7ce3f3f9a8b81a33fe9b0
EUC-JP ????▼?矣??艶b?醫??嚥▲?魏 001111110011111100111111001111111010001010100111001111111110001011100011001111110011111110110001111100001010001111100010001111111110111011010000001111110011111111010011111010111010001010100101001111111111001010110010 3f3f3f3fa2a73fe2e33f3fb1f0a3e23feed03f3fd3eba2a53ff2b2
UTF-8 僚녹뼐璘▼푻矣묒춻艶b뮓醫귣븶嚥▲꺃魏 111011111010011010111011111010111000010110111001111010111011110010010000111011111010011110101111111000101001011010111100111011011001000110111011111001111001111110100011111010111010110010010010111011001011011010111011111010001000100110110110111011111011110110000010111010111010111010010011111010011000011010101011111010101011011110100011111010111011100010110110111001011001101010100101111000101001011010110010111010101011101010000011111010011010110110001111 efa6bbeb85b9ebbc90efa7afe296bced91bbe79fa3ebac92ecb6bbe889b6efbd82ebae93e986abeab7a3ebb8b6e59aa5e296b2eaba83e9ad8f
UHC 僚녹뼐璘▼푻矣묒춻艶b뮓醫귣븶嚥▲꺃魏 1110100011101000101100111110110010010110100110001110110011011110101000011110010110111110100001111110101111111000100100011110110010101101100101111110011011111101101000111110001010010010100111111110110010100010100000101110101110010101100111111110011010111111101000011110001110000011101011001110101011100000 e8e8b3ec9698ecdea1e5be87ebf891ecad97e6fda3e2929feca282eb959fe6bfa1e383aceae0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)