To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 æ±vWznfæ±vWzn^}Yæ±vWznfæ±vWzn^}bE 111001101011000101110110010101110111101001101110011001101110011010110001011101100101011101111010011011100101111001111101010110011110011010110001011101100101011101111010011011100110011011100110101100010111011001010111011110100110111001011110011111010110001001000101 e6b176577a6e66e6b176577a6e5e7d59e6b176577a6e66e6b176577a6e5e7d6245
SJIS-WIN ?±vWznf?±vWzn^}Y?±vWznf?±vWzn^}bE 00111111100000010111110101110110010101110111101001101110011001100011111110000001011111010111011001010111011110100110111001011110011111010101100100111111100000010111110101110110010101110111101001101110011001100011111110000001011111010111011001010111011110100110111001011110011111010110001001000101 3f817d76577a6e663f817d76577a6e5e7d593f817d76577a6e663f817d76577a6e5e7d6245
EUC-JP æ±vWznfæ±vWzn^}Yæ±vWznfæ±vWzn^}bE 100011111010100111000001101000011101111001110110010101110111101001101110011001101000111110101001110000011010000111011110011101100101011101111010011011100101111001111101010110011000111110101001110000011010000111011110011101100101011101111010011011100110011010001111101010011100000110100001110111100111011001010111011110100110111001011110011111010110001001000101 8fa9c1a1de76577a6e668fa9c1a1de76577a6e5e7d598fa9c1a1de76577a6e668fa9c1a1de76577a6e5e7d6245
UTF-8 æ±vWznfæ±vWzn^}Yæ±vWznfæ±vWzn^}bE 1100001110100110110000101011000101110110010101110111101001101110011001101100001110100110110000101011000101110110010101110111101001101110010111100111110101011001110000111010011011000010101100010111011001010111011110100110111001100110110000111010011011000010101100010111011001010111011110100110111001011110011111010110001001000101 c3a6c2b176577a6e66c3a6c2b176577a6e5e7d59c3a6c2b176577a6e66c3a6c2b176577a6e5e7d6245
UHC æ±vWznfæ±vWzn^}Yæ±vWznfæ±vWzn^}bE 1010100110100001101000011011111001110110010101110111101001101110011001101010100110100001101000011011111001110110010101110111101001101110010111100111110101011001101010011010000110100001101111100111011001010111011110100110111001100110101010011010000110100001101111100111011001010111011110100110111001011110011111010110001001000101 a9a1a1be76577a6e66a9a1a1be76577a6e5e7d59a9a1a1be76577a6e66a9a1a1be76577a6e5e7d6245

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)