To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 縡?醍??矜??啼?仲?縡?醍??矜??啼?仲?^ 1110001101110001001111111001000111100111001111110011111111100001111000000011111100111111100110100110010100111111100100101000011100111111111000110111000100111111100100011110011100111111001111111110000111100000001111110011111110011010011001010011111110010010100001110011111101011110 e3713f91e73f3fe1e03f3f9a653f92873fe3713f91e73f3fe1e03f3f9a653f92873f5e
EUC-JP 縡?醍??矜??啼?仲?縡?醍??矜??啼?仲?^ 1110010111010010001111111100001011101001001111110011111111100010111000100011111100111111110100111100011000111111110000111110011100111111111001011101001000111111110000101110100100111111001111111110001011100010001111110011111111010011110001100011111111000011111001110011111101011110 e5d23fc2e93f3fe2e23f3fd3c63fc3e73fe5d23fc2e93f3fe2e23f3fd3c63fc3e73f5e
UTF-8 縡렕醍당렚矜렣렔啼렮仲줏縡렕醍당렚矜렣렔啼렮仲줌^ 11100111101110001010000111101011101000001001010111101001100001101000110111101011100010111011100111101011101000001001101011100111100111111001110011101011101000001010001111101011101000001001010011100101100101011011110011101011101000001010111011100100101110111011001011101100101001001000111111100111101110001010000111101011101000001001010111101001100001101000110111101011100010111011100111101011101000001001101011100111100111111001110011101011101000001010001111101011101000001001010011100101100101011011110011101011101000001010111011100100101110111011001011101100101001001000110001011110 e7b8a1eba095e9868deb8bb9eba09ae79f9ceba0a3eba094e595bceba0aee4bbb2eca48fe7b8a1eba095e9868deb8bb9eba09ae79f9ceba0a3eba094e595bceba0aee4bbb2eca48c5e
UHC 縡렕醍당렚矜렣렔啼렮仲줏縡렕醍당렚矜렣렔啼렮仲줌^ 11101110101011011000111010101010111100001011010110110100111001111000111010101101110100001110100010001110101101001000111010101001111100001010011010001110101110111111000111101010110000011101111011101110101011011000111010101010111100001011010110110100111001111000111010101101110100001110100010001110101101001000111010101001111100001010011010001110101110111111000111101010110000011101110001011110 eead8eaaf0b5b4e78eadd0e88eb48ea9f0a68ebbf1eac1deeead8eaaf0b5b4e78eadd0e88eb48ea9f0a68ebbf1eac1dc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)