To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 蝴・縲南発莠護サ厭蝴・縲南発莠護サ閲^ 111001011001101010100101111000111000000010010011111011001001010010101101111001001011101010001100111011001011101110001001011111011110010110011010101001011110001110000000100100111110110010010100101011011110010010111010100011001110110010111011100010010111101101011110 e59aa5e38093ec94ade4ba8cecbb897de59aa5e38093ec94ade4ba8cecbb897b5e
EUC-JP 蝴・縲南発莠護サ厭蝴・縲南発莠護サ閲^ 11101001111110101000111010100101111001011110000011000110111011101100100010101111111010001011110010111000111011101000111010111011101100011101111011101001111110101000111010100101111001011110000011000110111011101100100010101111111010001011110010111000111011101000111010111011101100011101110001011110 e9fa8ea5e5e0c6eec8afe8bcb8ee8ebbb1dee9fa8ea5e5e0c6eec8afe8bcb8ee8ebbb1dc5e
UTF-8 蝴・縲南発莠護サ厭蝴・縲南発莠護サ閲^ 11101000100111011011010011101111101111011010010111100111101110001011001011100101100011011001011111100111100110011011101011101000100011101010000011101000101011011011011111101111101111011011101111100101100011101010110111101000100111011011010011101111101111011010010111100111101110001011001011100101100011011001011111100111100110011011101011101000100011101010000011101000101011011011011111101111101111011011101111101001100101101011001001011110 e89db4efbda5e7b8b2e58d97e799bae88ea0e8adb7efbdbbe58eade89db4efbda5e7b8b2e58d97e799bae88ea0e8adb7efbdbbe996b25e
UHC 蝴??南??護?厭蝴??南??護??^ 1111101111011101001111110011111111010001111101010011111100111111111110111101111000111111111001101111010011111011110111010011111100111111110100011111010100111111001111111111101111011110001111110011111101011110 fbdd3f3fd1f53f3ffbde3fe6f4fbdd3f3fd1f53f3ffbde3f3f5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)