To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????^????}v????^????}vB 0011111100111111001111110011111101011110001111110011111100111111001111110111110101110110001111110011111100111111001111110101111000111111001111110011111100111111011111010111011001000010 3f3f3f3f5e3f3f3f3f7d763f3f3f3f5e3f3f3f3f7d7642
SJIS-WIN 辣ョ螳欷^辣ョ螳歃}v辣ョ螳欷^辣ョ螳歃}vB 1110011110000101101011101110010110101110100111110101011101011110111001111000010110101110111001011010111010011111010111000111110101110110111001111000010110101110111001011010111010011111010101110101111011100111100001011010111011100101101011101001111101011100011111010111011001000010 e785aee5ae9f575ee785aee5ae9f5c7d76e785aee5ae9f575ee785aee5ae9f5c7d7642
EUC-JP 辣ョ螳欷^辣ョ螳歃}v辣ョ螳欷^辣ョ螳歃}vB 111011011110010110001110101011101110101010110000110111011011100001011110111011011110010110001110101011101110101010110000110111011011110101111101011101101110110111100101100011101010111011101010101100001101110110111000010111101110110111100101100011101010111011101010101100001101110110111101011111010111011001000010 ede58eaeeab0ddb85eede58eaeeab0ddbd7d76ede58eaeeab0ddb85eede58eaeeab0ddbd7d7642
UTF-8 辣ョ螳欷^辣ョ螳歃}v辣ョ螳欷^辣ョ螳歃}vB 11101000101111101010001111101111101111011010111011101000100111101011001111100110101011001011011101011110111010001011111010100011111011111011110110101110111010001001111010110011111001101010110110000011011111010111011011101000101111101010001111101111101111011010111011101000100111101011001111100110101011001011011101011110111010001011111010100011111011111011110110101110111010001001111010110011111001101010110110000011011111010111011001000010 e8bea3efbdaee89eb3e6acb75ee8bea3efbdaee89eb3e6ad837d76e8bea3efbdaee89eb3e6acb75ee8bea3efbdaee89eb3e6ad837d7642
UHC 辣?螳?^辣?螳?}v辣?螳?^辣?螳?}vB 11010101101110000011111111010011110110010011111101011110110101011011100000111111110100111101100100111111011111010111011011010101101110000011111111010011110110010011111101011110110101011011100000111111110100111101100100111111011111010111011001000010 d5b83fd3d93f5ed5b83fd3d93f7d76d5b83fd3d93f5ed5b83fd3d93f7d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)