To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????N}????????N{^ 001111110011111100111111001111110011111100111111001111110011111101001110011111010011111100111111001111110011111100111111001111110011111100111111010011100111101101011110 3f3f3f3f3f3f3f3f4e7d3f3f3f3f3f3f3f3f4e7b5e
SJIS-WIN 姪?嫉?箏???N}姪?嫉?箏???N{^ 100101101100001100111111100011101011100100111111111000101011010100111111001111110011111101001110011111011001011011000011001111111000111010111001001111111110001010110101001111110011111100111111010011100111101101011110 96c33f8eb93fe2b53f3f3f4e7d96c33f8eb93fe2b53f3f3f4e7b5e
EUC-JP 姪?嫉?箏?侄?N}姪?嫉?箏?侄?N{^ 11001100110001010011111110111100101110110011111111100100101101110011111110001111101100001111111000111111010011100111110111001100110001010011111110111100101110110011111111100100101101110011111110001111101100001111111000111111010011100111101101011110 ccc53fbcbb3fe4b73f8fb0fe3f4e7dccc53fbcbb3fe4b73f8fb0fe3f4e7b5e
UTF-8 姪섣嫉롔箏렡侄석N}姪섣嫉롔箏렡侄석N{^ 1110010110100111101010101110110010000100101000111110010110101011100010011110101110100001100101001110011110101110100011111110101110100000101000011110010010111110100001001110110010000100100111010100111001111101111001011010011110101010111011001000010010100011111001011010101110001001111010111010000110010100111001111010111010001111111010111010000010100001111001001011111010000100111011001000010010011101010011100111101101011110 e5a7aaec84a3e5ab89eba194e7ae8feba0a1e4be84ec849d4e7de5a7aaec84a3e5ab89eba194e7ae8feba0a1e4be84ec849d4e7b5e
UHC 姪섣嫉롔箏렡侄석N}姪섣嫉롔箏렡侄석N{^ 11110010111010111011110010110010111100101110110010001110110110001110111010110100100011101011001011110010111010011011110010101110010011100111110111110010111010111011110010110010111100101110110010001110110110001110111010110100100011101011001011110010111010011011110010101110010011100111101101011110 f2ebbcb2f2ec8ed8eeb48eb2f2e9bcae4e7df2ebbcb2f2ec8ed8eeb48eb2f2e9bcae4e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)