To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 縡?鬱豆?梯?畯脈?拒縡?鬱豆?梯?畯脈?居^ 11100011011100010011111110011111010101001001001110100100001111111001001011110010001111111111101101101111100101101010110000111111100010111001000111100011011100010011111110011111010101001001001110100100001111111001001011110010001111111111101101101111100101101010110000111111100010111000111101011110 e3713f9f5493a43f92f23ffb6f96ac3f8b91e3713f9f5493a43f92f23ffb6f96ac3f8b8f5e
EUC-JP 縡?鬱豆?梯?畯脈?拒縡?鬱豆?梯?畯脈?居^ 111001011101001000111111110111011011010111000110101001100011111111000100111101000011111110001111110011011011101111001100101011100011111110110101111100011110010111010010001111111101110110110101110001101010011000111111110001001111010000111111100011111100110110111011110011001010111000111111101101011110111101011110 e5d23fddb5c6a63fc4f43f8fcdbbccae3fb5f1e5d23fddb5c6a63fc4f43f8fcdbbccae3fb5ef5e
UTF-8 縡렕鬱豆뱌梯렟畯脈렮拒縡렕鬱豆뱌梯렟畯脈렮居^ 11100111101110001010000111101011101000001001010111101001101011001011000111101000101100011000011011101011101100011000110011100110101000101010111111101011101000001001111111100111100101011010111111101000100001001000100011101011101000001010111011100110100010111001001011100111101110001010000111101011101000001001010111101001101011001011000111101000101100011000011011101011101100011000110011100110101000101010111111101011101000001001111111100111100101011010111111101000100001001000100011101011101000001010111011100101101100011000010101011110 e7b8a1eba095e9acb1e8b186ebb18ce6a2afeba09fe795afe88488eba0aee68b92e7b8a1eba095e9acb1e8b186ebb18ce6a2afeba09fe795afe88488eba0aee5b1855e
UHC 縡렕鬱豆뱌梯렟畯脈렮拒縡렕鬱豆뱌梯렟畯脈렮居^ 111011101010110110001110101010101110101010100110110101001110011110111001111100101111000010101100100011101011000011110001111000011101100011100110100011101011101111001011110111101110111010101101100011101010101011101010101001101101010011100111101110011111001011110000101011001000111010110000111100011110000111011000111001101000111010111011110010111101110001011110 eead8eaaeaa6d4e7b9f2f0ac8eb0f1e1d8e68ebbcbdeeead8eaaeaa6d4e7b9f2f0ac8eb0f1e1d8e68ebbcbdc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)