To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??????恂??孃も?竊?????沃??幼 00111111001111110011111100111111001111110011111110011100100101100011111100111111100110110110111110000010111000000011111111100010100001100011111100111111001111110011111100111111100101111000000000111111001111111001011101100011 3f3f3f3f3f3f9c963f3f9b6f82e03fe2863f3f3f3f3f97803f3f9763
EUC-JP ???絪??恂??孃も?竊?????沃??幼 001111110011111100111111100011111101001111101100001111110011111111010111111101100011111100111111110101011101000010100100111000100011111111100011111001100011111100111111001111110011111100111111110011011110000000111111001111111100110111000100 3f3f3f8fd3ec3f3fd7f63f3fd5d0a4e23fe3e63f3f3f3f3fcde03f3fcdc4
UTF-8 列룸쓷絪든뙴恂㏓렢孃も뫖竊뗤퓠琉울폇沃쇰뙼幼 111011111010011010011100111010111010001110111000111011001001001110110111111001111011010110101010111010111001001110100000111010111001100110110100111001101000000110000010111000111000111110010011111010111010000010100010111001011010110110000011111000111000001010000010111010111010101110010110111001111010101110001010111010111001011110100100111011011001001110100000111011111010011110001100111011001001101010111000111011011000111110000111111001101011001010000011111011001000011110110000111010111001100110111100111001011011100110111100 efa69ceba3b8ec93b7e7b5aaeb93a0eb99b4e68182e38f93eba0a2e5ad83e38282ebab96e7ab8aeb97a4ed93a0efa78cec9ab8ed8f87e6b283ec87b0eb99bce5b9bc
UHC 列룸쓷絪든뙴恂㏓렢孃も뫖竊뗤퓠琉울폇沃쇰뙼幼 1110011011101010101101111110101110011101100101001110110011011111101101011110011110001100101101111110001011100001101001111110101110001110101100111110010110111110101010101110001010010001101110001110111110111100100010111110010010111111100010011110101110100100101111111110111110111100100101001110100010101010101111001110101110001100101111111110101011101010 e6eab7eb9d94ecdfb5e78cb7e2e1a7eb8eb3e5beaae291b8efbc8be4bf89eba4bfefbc94e8aabceb8cbfeaea

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)