To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??????恂??餘??竊??惟????? 00111111001111110011111100111111001111110011111110011100100101100011111100111111111010010101000000111111001111111110001010000110001111110011111110001000110100100011111100111111001111110011111100111111 3f3f3f3f3f3f9c963f3fe9503f3fe2863f3f88d23f3f3f3f3f
EUC-JP ???絪??恂??餘?Ŧ竊??惟????? 0011111100111111001111111000111111010011111011000011111100111111110101111111011000111111001111111111000110110001001111111000111110101001101011111110001111100110001111110011111110110000110101000011111100111111001111110011111100111111 3f3f3f8fd3ec3f3fd7f63f3ff1b13f8fa9afe3e63f3fb0d43f3f3f3f3f
UTF-8 列룸쓷絪든뙴恂⑸젅餘됰Ŧ竊뗰쬁惟곤폋梨덂슖 1110111110100110100111001110101110100011101110001110110010010011101101111110011110110101101010101110101110010011101000001110101110011001101101001110011010000001100000101110001010010001101110001110110010100000100001011110100110100100100110001110101110010000101100001100010110100110111001111010101110001010111010111001011110110000111011001010110010000001111001101000001110011111111010101011001110100100111011011000111110001011111011111010011110100010111010111000110110000010111011001000101010010110 efa69ceba3b8ec93b7e7b5aaeb93a0eb99b4e68182e291b8eca085e9a498eb90b0c5a6e7ab8aeb97b0ecac81e6839feab3a4ed8f8befa7a2eb8d82ec8a96
UHC 列룸쓷絪든뙴恂⑸젅餘됰Ŧ竊뗰쬁惟곤폋梨덂슖 111001101110101010110111111010111001110110010100111011001101111110110101111001111000110010110111111000101110000110101001111010111010000010001000111001101010111010001001111010111010100010101110111011111011110010001011111011111010011010011000111010101110111010110000111011111011110010010110111011001011000110001000111001011001101010100101 e6eab7eb9d94ecdfb5e78cb7e2e1a9eba088e6ae89eba8aeefbc8befa698eaeeb0efbc96ecb188e59aa5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)