To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????U}?????????U{^ 0011111100111111001111110011111100111111001111110011111100111111001111110101010101111101001111110011111100111111001111110011111100111111001111110011111100111111010101010111101101011110 3f3f3f3f3f3f3f3f3f557d3f3f3f3f3f3f3f3f3f557b5e
SJIS-WIN ?????????U}?????????U{^ 0011111100111111001111110011111100111111001111110011111100111111001111110101010101111101001111110011111100111111001111110011111100111111001111110011111100111111010101010111101101011110 3f3f3f3f3f3f3f3f3f557d3f3f3f3f3f3f3f3f3f557b5e
EUC-JP ?????????U}?????????U{^ 0011111100111111001111110011111100111111001111110011111100111111001111110101010101111101001111110011111100111111001111110011111100111111001111110011111100111111010101010111101101011110 3f3f3f3f3f3f3f3f3f557d3f3f3f3f3f3f3f3f3f557b5e
UTF-8 連쇘뙱念곭뒙劣며뙑U}連쇘뙱念곭뒙劣며뙑U{^ 1110111110100110100110101110110010000111100110001110101110011001101100011110111110100110101000111110101010110011101011011110101110010010100110011110111110100110100111011110101110101001101100001110101110011001100100010101010101111101111011111010011010011010111011001000011110011000111010111001100110110001111011111010011010100011111010101011001110101101111010111001001010011001111011111010011010011101111010111010100110110000111010111001100110010001010101010111101101011110 efa69aec8798eb99b1efa6a3eab3adeb9299efa69deba9b0eb9991557defa69aec8798eb99b1efa6a3eab3adeb9299efa69deba9b0eb9991557b5e
UHC 連쇘뙱念곭뒙劣며뙑U}連쇘뙱念곭뒙劣며뙑U{^ 1110011011100110101111001110011110001100101101001110011011110110100000011110011110001010100101101110011011101011101110001110011110001100100101100101010101111101111001101110011010111100111001111000110010110100111001101111011010000001111001111000101010010110111001101110101110111000111001111000110010010110010101010111101101011110 e6e6bce78cb4e6f681e78a96e6ebb8e78c96557de6e6bce78cb4e6f681e78a96e6ebb8e78c96557b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)