To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????W????Jn}????W????Jn{^ 00111111001111110011111100111111010101110011111100111111001111110011111101001010011011100111110100111111001111110011111100111111010101110011111100111111001111110011111101001010011011100111101101011110 3f3f3f3f573f3f3f3f4a6e7d3f3f3f3f573f3f3f3f4a6e7b5e
SJIS-WIN 蝴蝎??W蝴蝎??Jn}蝴蝎??W蝴蝎??Jn{^ 111001011001101011100101100110010011111100111111010101111110010110011010111001011001100100111111001111110100101001101110011111011110010110011010111001011001100100111111001111110101011111100101100110101110010110011001001111110011111101001010011011100111101101011110 e59ae5993f3f57e59ae5993f3f4a6e7de59ae5993f3f57e59ae5993f3f4a6e7b5e
EUC-JP 蝴蝎??W蝴蝎??Jn}蝴蝎??W蝴蝎??Jn{^ 111010011111101011101001111110010011111100111111010101111110100111111010111010011111100100111111001111110100101001101110011111011110100111111010111010011111100100111111001111110101011111101001111110101110100111111001001111110011111101001010011011100111101101011110 e9fae9f93f3f57e9fae9f93f3f4a6e7de9fae9f93f3f57e9fae9f93f3f4a6e7b5e
UTF-8 蝴蝎렲렑W蝴蝎렲렑Jn}蝴蝎렲렑W蝴蝎렲렑Jn{^ 111010001001110110110100111010001001110110001110111010111010000010110010111010111010000010010001010101111110100010011101101101001110100010011101100011101110101110100000101100101110101110100000100100010100101001101110011111011110100010011101101101001110100010011101100011101110101110100000101100101110101110100000100100010101011111101000100111011011010011101000100111011000111011101011101000001011001011101011101000001001000101001010011011100111101101011110 e89db4e89d8eeba0b2eba09157e89db4e89d8eeba0b2eba0914a6e7de89db4e89d8eeba0b2eba09157e89db4e89d8eeba0b2eba0914a6e7b5e
UHC 蝴蝎렲렑W蝴蝎렲렑Jn}蝴蝎렲렑W蝴蝎렲렑Jn{^ 1111101111011101110010101110100110001110101111111000111010100110010101111111101111011101110010101110100110001110101111111000111010100110010010100110111001111101111110111101110111001010111010011000111010111111100011101010011001010111111110111101110111001010111010011000111010111111100011101010011001001010011011100111101101011110 fbddcae98ebf8ea657fbddcae98ebf8ea64a6e7dfbddcae98ebf8ea657fbddcae98ebf8ea64a6e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)