To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????N}????????N{^ 001111110011111100111111001111110011111100111111001111110011111101001110011111010011111100111111001111110011111100111111001111110011111100111111010011100111101101011110 3f3f3f3f3f3f3f3f4e7d3f3f3f3f3f3f3f3f4e7b5e
SJIS-WIN 姪?嫉?琥???N}姪?嫉?琥???N{^ 100101101100001100111111100011101011100100111111111000001110011000111111001111110011111101001110011111011001011011000011001111111000111010111001001111111110000011100110001111110011111100111111010011100111101101011110 96c33f8eb93fe0e63f3f3f4e7d96c33f8eb93fe0e63f3f3f4e7b5e
EUC-JP 姪?嫉?琥?侄?N}姪?嫉?琥?侄?N{^ 11001100110001010011111110111100101110110011111111100000111010000011111110001111101100001111111000111111010011100111110111001100110001010011111110111100101110110011111111100000111010000011111110001111101100001111111000111111010011100111101101011110 ccc53fbcbb3fe0e83f8fb0fe3f4e7dccc53fbcbb3fe0e83f8fb0fe3f4e7b5e
UTF-8 姪섣嫉롔琥렡侄석N}姪섣嫉롔琥렡侄석N{^ 1110010110100111101010101110110010000100101000111110010110101011100010011110101110100001100101001110011110010000101001011110101110100000101000011110010010111110100001001110110010000100100111010100111001111101111001011010011110101010111011001000010010100011111001011010101110001001111010111010000110010100111001111001000010100101111010111010000010100001111001001011111010000100111011001000010010011101010011100111101101011110 e5a7aaec84a3e5ab89eba194e790a5eba0a1e4be84ec849d4e7de5a7aaec84a3e5ab89eba194e790a5eba0a1e4be84ec849d4e7b5e
UHC 姪섣嫉롔琥렡侄석N}姪섣嫉롔琥렡侄석N{^ 11110010111010111011110010110010111100101110110010001110110110001111101111010000100011101011001011110010111010011011110010101110010011100111110111110010111010111011110010110010111100101110110010001110110110001111101111010000100011101011001011110010111010011011110010101110010011100111101101011110 f2ebbcb2f2ec8ed8fbd08eb2f2e9bcae4e7df2ebbcb2f2ec8ed8fbd08eb2f2e9bcae4e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)