To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????B 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 闢厶榊セ楢ス憺囂蟆ア闢厶榊セ楢ス憺囂蟆アB 1110100010010011100110011101000110001101111001011011111010010011111010001011110110011100111010011001101010001111111001011011000010110001111010001001001110011001110100011000110111100101101111101001001111101000101111011001110011101001100110101000111111100101101100001011000101000010 e89399d18de5be93e8bd9ce99a8fe5b0b1e89399d18de5be93e8bd9ce99a8fe5b0b142
EUC-JP 闢厶榊セ楢ス憺囂蟆ア闢厶榊セ楢ス憺囂蟆アB 1110111111110011110100101101001110111010111001111000111010111110110001101110101010001110101111011101100011101011110100111110111111101010101100101000111010110001111011111111001111010010110100111011101011100111100011101011111011000110111010101000111010111101110110001110101111010011111011111110101010110010100011101011000101000010 eff3d2d3bae78ebec6ea8ebdd8ebd3efeab28eb1eff3d2d3bae78ebec6ea8ebdd8ebd3efeab28eb142
UTF-8 闢厶榊セ楢ス憺囂蟆ア闢厶榊セ楢ス憺囂蟆アB 11101001100101111010001011100101100011101011011011100110101001101000101011101111101111011011111011100110101001011010001011101111101111011011110111100110100001101011101011100101100110111000001011101000100111111000011011101111101111011011000111101001100101111010001011100101100011101011011011100110101001101000101011101111101111011011111011100110101001011010001011101111101111011011110111100110100001101011101011100101100110111000001011101000100111111000011011101111101111011011000101000010 e997a2e58eb6e6a68aefbdbee6a5a2efbdbde686bae59b82e89f86efbdb1e997a2e58eb6e6a68aefbdbee6a5a2efbdbde686bae59b82e89f86efbdb142
UHC 闢???楢?憺???闢???楢?憺???B 110111001010001100111111001111110011111111101010111110010011111111010011101111000011111100111111001111111101110010100011001111110011111100111111111010101111100100111111110100111011110000111111001111110011111101000010 dca33f3f3feaf93fd3bc3f3f3fdca33f3f3feaf93fd3bc3f3f3f42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)