To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????U}?????????U{^ 0011111100111111001111110011111100111111001111110011111100111111001111110101010101111101001111110011111100111111001111110011111100111111001111110011111100111111010101010111101101011110 3f3f3f3f3f3f3f3f3f557d3f3f3f3f3f3f3f3f3f557b5e
SJIS-WIN ?????????U}?????????U{^ 0011111100111111001111110011111100111111001111110011111100111111001111110101010101111101001111110011111100111111001111110011111100111111001111110011111100111111010101010111101101011110 3f3f3f3f3f3f3f3f3f557d3f3f3f3f3f3f3f3f3f557b5e
EUC-JP ?????????U}?????????U{^ 0011111100111111001111110011111100111111001111110011111100111111001111110101010101111101001111110011111100111111001111110011111100111111001111110011111100111111010101010111101101011110 3f3f3f3f3f3f3f3f3f557d3f3f3f3f3f3f3f3f3f557b5e
UTF-8 묰뫌뫓묮뫔뫇묭뫛몺U}묰뫌뫓묮뫔뫇묭뫛몺U{^ 1110101110101100101100001110101110101011100011001110101110101011100100111110101110101100101011101110101110101011100101001110101110101011100001111110101110101100101011011110101110101011100110111110101110101010101110100101010101111101111010111010110010110000111010111010101110001100111010111010101110010011111010111010110010101110111010111010101110010100111010111010101110000111111010111010110010101101111010111010101110011011111010111010101010111010010101010111101101011110 ebacb0ebab8cebab93ebacaeebab94ebab87ebacadebab9bebaaba557debacb0ebab8cebab93ebacaeebab94ebab87ebacadebab9bebaaba557b5e
UHC 묰뫌뫓묮뫔뫇묭뫛몺U}묰뫌뫓묮뫔뫇묭뫛몺U{^ 1001001001000111100100011010111010010001101101011001001001000101100100011011011010010001101010101001001001000100100100011011101110010001101000000101010101111101100100100100011110010001101011101001000110110101100100100100010110010001101101101001000110101010100100100100010010010001101110111001000110100000010101010111101101011110 924791ae91b5924591b691aa924491bb91a0557d924791ae91b5924591b691aa924491bb91a0557b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)