To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????O 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101001111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f4f
SJIS-WIN ???肉??飮?????幼?????若??O 0011111100111111001111111001001111110111001111110011111110011111010110100011111100111111001111110011111100111111100101110110001100111111001111110011111100111111001111111000111011100001001111110011111101001111 3f3f3f93f73f3f9f5a3f3f3f3f3f97633f3f3f3f3f8ee13f3f4f
EUC-JP ???肉??飮?????幼??洧??若??O 00111111001111110011111111000110111110010011111100111111110111011011101100111111001111110011111100111111001111111100110111000100001111110011111110001111110001111011010000111111001111111011110011100011001111110011111101001111 3f3f3fc6f93f3fddbb3f3f3f3f3fcdc43f3f8fc7b43f3fbce33f3f4f
UTF-8 列룸씈肉꾢렘飮뉛폋濾낅슓幼뚨뙴洧좎띅若뽯쫨O 11101111101001101001110011101011101000111011100011101100100101001000100011101000100000101000100111101010101111101010001011101011101000001001100011101001101000111010111011101011100010011001101111101101100011111000101111101111101001101000010011101011100000101000010111101100100010101001001111100101101110011011110011101011100110101010100011101011100110011011010011100110101101001010011111101100101000101000111011101011100111011000010111101000100010111010010111101011101111011010111111101100101010111010100001001111 efa69ceba3b8ec9488e88289eabea2eba098e9a3aeeb899bed8f8befa684eb8285ec8a93e5b9bceb9aa8eb99b4e6b4a7eca28eeb9d85e88ba5ebbdafecaba84f
UHC 列룸씈肉꾢렘飮뉛폋濾낅슓幼뚨뙴洧좎띅若뽯쫨O 11100110111010101011011111101011100111011010000011101011101111111000010011100101101101111011110111101011111001101000011111101111101111001001011011100110101001001000010111101011100110101010001011101010111010101000110011100111100011001011011111101010111110111010000011101100100011011011111111100101101101001001011011101011101001101000000101001111 e6eab7eb9da0ebbf84e5b7bdebe687efbc96e6a485eb9aa2eaea8ce78cb7eafba0ec8dbfe5b496eba6814f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)