To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 種?義?嘲私??猥?萄楢?嘲私??虞? 1000111011101101001111111000101101100000001111111001101001111101100011101000010000111111001111111110000011001110001111111001001110111000100100111110100000111111100110100111110110001110100001000011111100111111100010111111000100111111 8eed3f8b603f9a7d8e843f3fe0ce3f93b893e83f9a7d8e843f3f8bf13f
EUC-JP 種?義?嘲私??猥?萄楢?嘲私??虞? 1011110011101111001111111011010111000001001111111101001111011110101110111110010000111111001111111110000011010000001111111100011010111010110001101110101000111111110100111101111010111011111001000011111100111111101101101111001100111111 bcef3fb5c13fd3debbe43f3fe0d03fc6bac6ea3fd3debbe43f3fb6f33f
UTF-8 種렕義렎嘲私렢렦猥몃萄楢렎嘲私렢렦虞렛 111001111010100010101110111010111010000010010101111001111011111010101001111010111010000010001110111001011001100010110010111001111010011110000001111010111010000010100010111010111010000010100110111001111000110010100101111010111010101010000011111010001001000010000100111001101010010110100010111010111010000010001110111001011001100010110010111001111010011110000001111010111010000010100010111010111010000010100110111010001001100110011110111010111010000010011011 e7a8aeeba095e7bea9eba08ee598b2e7a781eba0a2eba0a6e78ca5ebaa83e89084e6a5a2eba08ee598b2e7a781eba0a2eba0a6e8999eeba09b
UHC 種렕義렎嘲私렢렦猥몃萄楢렎嘲私렢렦虞렛 1111000011111010100011101010101011101011111110011000111010100100111100001011111111011110111001111000111010110011100011101011010111101000111001011011100011101011110101001010110011101010111110011000111010100100111100001011111111011110111001111000111010110011100011101011010111101001111001011011011110111111 f0fa8eaaebf98ea4f0bfdee78eb38eb5e8e5b8ebd4aceaf98ea4f0bfdee78eb38eb5e9e5b7bf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)