To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????N}??????????N{^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111010011100111110100111111001111110011111100111111001111110011111100111111001111110011111100111111010011100111101101011110 3f3f3f3f3f3f3f3f3f3f4e7d3f3f3f3f3f3f3f3f3f3f4e7b5e
SJIS-WIN 凹??曖?凹??曖?N}凹??曖?凹??曖?N{^ 100010011001101000111111001111111001111001000010001111111000100110011010001111110011111110011110010000100011111101001110011111011000100110011010001111110011111110011110010000100011111110001001100110100011111100111111100111100100001000111111010011100111101101011110 899a3f3f9e423f899a3f3f9e423f4e7d899a3f3f9e423f899a3f3f9e423f4e7b5e
EUC-JP 凹??曖?凹??曖?N}凹??曖?凹??曖?N{^ 101100011111101000111111001111111101101110100011001111111011000111111010001111110011111111011011101000110011111101001110011111011011000111111010001111110011111111011011101000110011111110110001111110100011111100111111110110111010001100111111010011100111101101011110 b1fa3f3fdba33fb1fa3f3fdba33f4e7db1fa3f3fdba33fb1fa3f3fdba33f4e7b5e
UTF-8 凹든킍曖냜凹든킍曖냚N}凹든킍曖냜凹든킍曖냚N{^ 1110010110000111101110011110101110010011101000001110110110000010100011011110011010011011100101101110101110000011100111001110010110000111101110011110101110010011101000001110110110000010100011011110011010011011100101101110101110000011100110100100111001111101111001011000011110111001111010111001001110100000111011011000001010001101111001101001101110010110111010111000001110011100111001011000011110111001111010111001001110100000111011011000001010001101111001101001101110010110111010111000001110011010010011100111101101011110 e587b9eb93a0ed828de69b96eb839ce587b9eb93a0ed828de69b96eb839a4e7de587b9eb93a0ed828de69b96eb839ce587b9eb93a0ed828de69b96eb839a4e7b5e
UHC 凹든킍曖냜凹든킍曖냚N}凹든킍曖냜凹든킍曖냚N{^ 111010001110101010110101111001111011010010011001111001001111001010000110011010001110100011101010101101011110011110110100100110011110010011110010100001100110011001001110011111011110100011101010101101011110011110110100100110011110010011110010100001100110100011101000111010101011010111100111101101001001100111100100111100101000011001100110010011100111101101011110 e8eab5e7b499e4f28668e8eab5e7b499e4f286664e7de8eab5e7b499e4f28668e8eab5e7b499e4f286664e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)