To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 蠍ク讚毳v蠍ク讚毳vB 111010001010000010001101111011111011110110111000111010001010111010011010111001101010111110110011011101101110100010100000100011011110111110111101101110001110100010101110100110101110011010101111101100110111011001000010 e8a08defbdb8e8ae9ae6afb376e8a08defbdb8e8ae9ae6afb37642
SJIS-WIN ????????????v????????????vB 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111011101100011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110111011001000010 3f3f3f3f3f3f3f3f3f3f3f3f763f3f3f3f3f3f3f3f3f3f3f3f7642
EUC-JP è??ï?¸è®?æ¯?vè??ï?¸è®?æ¯?vB 10001111101010111011001000111111001111111000111110101011110000010011111110001111101000101011000110001111101010111011001010001111101000101110111000111111100011111010100111000001100011111010001010110100001111110111011010001111101010111011001000111111001111111000111110101011110000010011111110001111101000101011000110001111101010111011001010001111101000101110111000111111100011111010100111000001100011111010001010110100001111110111011001000010 8fabb23f3f8fabc13f8fa2b18fabb28fa2ee3f8fa9c18fa2b43f768fabb23f3f8fabc13f8fa2b18fabb28fa2ee3f8fa9c18fa2b43f7642
UTF-8 蠍ク讚毳v蠍ク讚毳vB 110000111010100011000010101000001100001010001101110000111010111111000010101111011100001010111000110000111010100011000010101011101100001010011010110000111010011011000010101011111100001010110011011101101100001110101000110000101010000011000010100011011100001110101111110000101011110111000010101110001100001110101000110000101010111011000010100110101100001110100110110000101010111111000010101100110111011001000010 c3a8c2a0c28dc3afc2bdc2b8c3a8c2aec29ac3a6c2afc2b376c3a8c2a0c28dc3afc2bdc2b8c3a8c2aec29ac3a6c2afc2b37642
UHC ????½¸?®?æ?³v????½¸?®?æ?³vB 00111111001111110011111100111111101010001111011010100010101011000011111110100010111001110011111110101001101000010011111110101001111110000111011000111111001111110011111100111111101010001111011010100010101011000011111110100010111001110011111110101001101000010011111110101001111110000111011001000010 3f3f3f3fa8f6a2ac3fa2e73fa9a13fa9f8763f3f3f3fa8f6a2ac3fa2e73fa9a13fa9f87642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)