To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????W 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101010111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f57
SJIS-WIN 梯?畯脈?耿??趙麥??源?衣私??W 10010010111100100011111111111011011011111001011010101100001111111110001111010100001111110011111111100110111000101110101001101101001111110011111110001100101110010011111110001000110111111000111010000100001111110011111101010111 92f23ffb6f96ac3fe3d43f3fe6e2ea6d3f3f8cb93f88df8e843f3f57
EUC-JP 梯?畯脈?耿??趙麥??源?衣私??W 1100010011110100001111111000111111001101101110111100110010101110001111111110011011010110001111110011111111101100111001001111001111001110001111110011111110111000101110110011111110110000111000011011101111100100001111110011111101010111 c4f43f8fcdbbccae3fe6d63f3fece4f3ce3f3fb8bb3fb0e1bbe43f3f57
UTF-8 梯렟畯脈렮耿렕렟趙麥렧렢源렰衣私렟냠W 11100110101000101010111111101011101000001001111111100111100101011010111111101000100001001000100011101011101000001010111011101000100000001011111111101011101000001001010111101011101000001001111111101000101101101001100111101001101110101010010111101011101000001010011111101011101000001010001011100110101110101001000011101011101000001011000011101000101000011010001111100111101001111000000111101011101000001001111111101011100000111010000001010111 e6a2afeba09fe795afe88488eba0aee880bfeba095eba09fe8b699e9baa5eba0a7eba0a2e6ba90eba0b0e8a1a3e7a781eba09feb83a057
UHC 梯렟畯脈렮耿렕렟趙麥렧렢源렰衣私렟냠W 11110000101011001000111010110000111100011110000111011000111001101000111010111011110011001110101010001110101010101000111010110000111100001110000111011000111010101000111010110110100011101011001111101010101110011000111010111101111010111111110111011110111001111000111010110000101100111100100001010111 f0ac8eb0f1e1d8e68ebbccea8eaa8eb0f0e1d8ea8eb68eb3eab98ebdebfddee78eb0b3c857

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)