To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 霄エセハ踉シ鱆軸[霄エセハ踉シ鱆軸[^ 11101000101110101011010011110001100011101011111011001010111001101111001010111100111010011110000110001110101100100101101111101000101110101011010011110001100011101011111011001010111001101111001010111100111010011110000110001110101100100101101101011110 e8bab4f18ebecae6f2bce9e18eb25be8bab4f18ebecae6f2bce9e18eb25b5e
EUC-JP 霄エ?セハ踉シ鱆軸[霄エ?セハ踉シ鱆軸[^ 11110000101111001000111010110100001111111000111010111110100011101100101011101100111101001000111010111100111100101110001110111100101101000101101111110000101111001000111010110100001111111000111010111110100011101100101011101100111101001000111010111100111100101110001110111100101101000101101101011110 f0bc8eb43f8ebe8ecaecf48ebcf2e3bcb45bf0bc8eb43f8ebe8ecaecf48ebcf2e3bcb45b5e
UTF-8 霄エセハ踉シ鱆軸[霄エセハ踉シ鱆軸[^ 111010011001110010000100111011111011110110110100111011101000010010001001111011111011110110111110111011111011111010001010111010001011100010001001111011111011110110111100111010011011000110000110111010001011101110111000010110111110100110011100100001001110111110111101101101001110111010000100100010011110111110111101101111101110111110111110100010101110100010111000100010011110111110111101101111001110100110110001100001101110100010111011101110000101101101011110 e99c84efbdb4ee8489efbdbeefbe8ae8b889efbdbce9b186e8bbb85be99c84efbdb4ee8489efbdbeefbe8ae8b889efbdbce9b186e8bbb85b5e
UHC ????????軸[????????軸[^ 0011111100111111001111110011111100111111001111110011111100111111111101011110111001011011001111110011111100111111001111110011111100111111001111110011111111110101111011100101101101011110 3f3f3f3f3f3f3f3ff5ee5b3f3f3f3f3f3f3f3ff5ee5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)