To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 址???祭???????祭??? 10011010101011000011111100111111001111111000110111010101001111110011111100111111001111110011111100111111001111111000110111010101001111110011111100111111 9aac3f3f3f8dd53f3f3f3f3f3f3f8dd53f3f3f
EUC-JP 址???祭???????祭??? 11010100101011100011111100111111001111111011101011010111001111110011111100111111001111110011111100111111001111111011101011010111001111110011111100111111 d4ae3f3f3fbad73f3f3f3f3f3f3fbad73f3f3f
UTF-8 址얹렰렧祭잴렲쇰렊얹렰렧祭잼렧슴 111001011001110110000000111011001001011010111001111010111010000010110000111010111010000010100111111001111010010110101101111011001001111010110100111010111010000010110010111011001000011110110000111010111010000010001010111011001001011010111001111010111010000010110000111010111010000010100111111001111010010110101101111011001001111010111100111010111010000010100111111011001000101010110100 e59d80ec96b9eba0b0eba0a7e7a5adec9eb4eba0b2ec87b0eba08aec96b9eba0b0eba0a7e7a5adec9ebceba0a7ec8ab4
UHC 址얹렰렧祭잴렲쇰렊얹렰렧祭잼렧슴 1111001010100011101111101111000110001110101111011000111010110110111100001010111011000000111010101000111010111111101111001110101110001110101000011011111011110001100011101011110110001110101101101111000010101110110000001110101110001110101101101011110110111111 f2a3bef18ebd8eb6f0aec0ea8ebfbceb8ea1bef18ebd8eb6f0aec0eb8eb6bdbf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)