To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 áÆú×áÆö½j}váÆú×áÆö½j}vB 1110000111000110111110101101011111100001110001101111011010111101011010100111110101110110111000011100011011111010110101111110000111000110111101101011110101101010011111010111011001000010 e1c6fad7e1c6f6bd6a7d76e1c6fad7e1c6f6bd6a7d7642
SJIS-WIN ???×????j}v???×????j}vB 00111111001111110011111110000001011111100011111100111111001111110011111101101010011111010111011000111111001111110011111110000001011111100011111100111111001111110011111101101010011111010111011001000010 3f3f3f817e3f3f3f3f6a7d763f3f3f817e3f3f3f3f6a7d7642
EUC-JP áÆú×áÆö?j}váÆú×áÆö?j}vB 10001111101010111010000110001111101010011010000110001111101010111110001010100001110111111000111110101011101000011000111110101001101000011000111110101011110100110011111101101010011111010111011010001111101010111010000110001111101010011010000110001111101010111110001010100001110111111000111110101011101000011000111110101001101000011000111110101011110100110011111101101010011111010111011001000010 8faba18fa9a18fabe2a1df8faba18fa9a18fabd33f6a7d768faba18fa9a18fabe2a1df8faba18fa9a18fabd33f6a7d7642
UTF-8 áÆú×áÆö½j}váÆú×áÆö½j}vB 110000111010000111000011100001101100001110111010110000111001011111000011101000011100001110000110110000111011011011000010101111010110101001111101011101101100001110100001110000111000011011000011101110101100001110010111110000111010000111000011100001101100001110110110110000101011110101101010011111010111011001000010 c3a1c386c3bac397c3a1c386c3b6c2bd6a7d76c3a1c386c3bac397c3a1c386c3b6c2bd6a7d7642
UHC ?Æ?×?Æ?½j}v?Æ?×?Æ?½j}vB 00111111101010001010000100111111101000011011111100111111101010001010000100111111101010001111011001101010011111010111011000111111101010001010000100111111101000011011111100111111101010001010000100111111101010001111011001101010011111010111011001000010 3fa8a13fa1bf3fa8a13fa8f66a7d763fa8a13fa1bf3fa8a13fa8f66a7d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)