To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 紙?肢?贈????烽??肢?贈????六 100011101000011000111111100011101000100000111111100100011010000100111111001111110011111100111111111000001000001000111111001111111000111010001000001111111001000110100001001111110011111100111111001111111001100001011010 8e863f8e883f91a13f3f3f3fe0823f3f8e883f91a13f3f3f3f985a
EUC-JP 紙?肢?贈????烽??肢?贈????六 101110111110011000111111101110111110100000111111110000101010001100111111001111110011111100111111110111111110001000111111001111111011101111101000001111111100001010100011001111110011111100111111001111111100111110111011 bbe63fbbe83fc2a33f3f3f3fdfe23f3fbbe83fc2a33f3f3f3fcfbb
UTF-8 紙렗肢렖贈얹렱폈렱烽렢렗肢렖贈얹렱폈렱六 111001111011010010011001111010111010000010010111111010001000001010100010111010111010000010010110111010001011010010001000111011001001011010111001111010111010000010110001111011011000111110001000111010111010000010110001111001111000001110111101111010111010000010100010111010111010000010010111111010001000001010100010111010111010000010010110111010001011010010001000111011001001011010111001111010111010000010110001111011011000111110001000111010111010000010110001111001011000010110101101 e7b499eba097e882a2eba096e8b488ec96b9eba0b1ed8f88eba0b1e783bdeba0a2eba097e882a2eba096e8b488ec96b9eba0b1ed8f88eba0b1e585ad
UHC 紙렗肢렖贈얹렱폈렱烽렢렗肢렖贈얹렱폈렱六 11110010101101011000111010101100111100101011011010001110101010111111000111111100101111101111000110001110101111101100011011110001100011101011111011011100111010111000111010110011100011101010110011110010101101101000111010101011111100011111110010111110111100011000111010111110110001101111000110001110101111101101011110111111 f2b58eacf2b68eabf1fcbef18ebec6f18ebedceb8eb38eacf2b68eabf1fcbef18ebec6f18ebed7bf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)