To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???^h???^fN}???^h???^fN{^ 00111111001111110011111101011110011010000011111100111111001111110101111001100110010011100111110100111111001111110011111101011110011010000011111100111111001111110101111001100110010011100111101101011110 3f3f3f5e683f3f3f5e664e7d3f3f3f5e683f3f3f5e664e7b5e
SJIS-WIN 雎舌そ^h雎舌そ^fN}雎舌そ^h雎舌そ^fN{^ 11101000101100011001000011100011100000101011101101011110011010001110100010110001100100001110001110000010101110110101111001100110010011100111110111101000101100011001000011100011100000101011101101011110011010001110100010110001100100001110001110000010101110110101111001100110010011100111101101011110 e8b190e382bb5e68e8b190e382bb5e664e7de8b190e382bb5e68e8b190e382bb5e664e7b5e
EUC-JP 雎舌そ^h雎舌そ^fN}雎舌そ^h雎舌そ^fN{^ 11110000101100111100000011100101101001001011110101011110011010001111000010110011110000001110010110100100101111010101111001100110010011100111110111110000101100111100000011100101101001001011110101011110011010001111000010110011110000001110010110100100101111010101111001100110010011100111101101011110 f0b3c0e5a4bd5e68f0b3c0e5a4bd5e664e7df0b3c0e5a4bd5e68f0b3c0e5a4bd5e664e7b5e
UTF-8 雎舌そ^h雎舌そ^fN}雎舌そ^h雎舌そ^fN{^ 11101001100110111000111011101000100010001000110011100011100000011001110101011110011010001110100110011011100011101110100010001000100011001110001110000001100111010101111001100110010011100111110111101001100110111000111011101000100010001000110011100011100000011001110101011110011010001110100110011011100011101110100010001000100011001110001110000001100111010101111001100110010011100111101101011110 e99b8ee8888ce3819d5e68e99b8ee8888ce3819d5e664e7de99b8ee8888ce3819d5e68e99b8ee8888ce3819d5e664e7b5e
UHC 雎舌そ^h雎舌そ^fN}雎舌そ^h雎舌そ^fN{^ 11101110110100011110000011011111101010101011110101011110011010001110111011010001111000001101111110101010101111010101111001100110010011100111110111101110110100011110000011011111101010101011110101011110011010001110111011010001111000001101111110101010101111010101111001100110010011100111101101011110 eed1e0dfaabd5e68eed1e0dfaabd5e664e7deed1e0dfaabd5e68eed1e0dfaabd5e664e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)