To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??????恂ワ?筌??竊??款?▼? 00111111001111110011111100111111001111110011111110011100100101101000001110001111001111111110001010100011001111110011111111100010100001100011111100111111100010101011110000111111100000011010010100111111 3f3f3f3f3f3f9c96838f3fe2a33f3fe2863f3f8abc3f81a53f
EUC-JP ???彛??恂ワ?筌??竊??款沅▼? 0011111100111111001111111000111110111100111110100011111100111111110101111111011010100101111011110011111111100100101001010011111100111111111000111110011000111111001111111011010010111110100011111100011011101001101000101010011100111111 3f3f3f8fbcfa3f3fd7f6a5ef3fe4a53f3fe3e63f3fb4be8fc6e9a2a73f
UTF-8 列룸씈彛쎿씭恂ワ폋筌뉗뮆竊뗥땔款沅▼뿽 111011111010011010011100111010111010001110111000111011001001010010001000111001011011110110011011111011001000111010111111111011001001010010101101111001101000000110000010111000111000001110101111111011011000111110001011111001111010110110001100111010111000100110010111111010111010111010000110111001111010101110001010111010111001011110100101111010111001010110010100111001101010110010111110111001101011001010000101111000101001011010111100111010111011111110111101 efa69ceba3b8ec9488e5bd9bec8ebfec94ade68182e383afed8f8be7ad8ceb8997ebae86e7ab8aeb97a5eb9594e6acbee6b285e296bcebbfbd
UHC 列룸씈彛쎿씭恂ワ폋筌뉗뮆竊뗥땔款沅▼뿽 1110011011101010101101111110101110011101101000001110110010101101100110111110011010011101101111101110001011100001101010111110111110111100100101101110111110100111100001111110110010010010100101011110111110111100100010111110010110110110101010101100111010110011111010101011011010100001111001011001011110111101 e6eab7eb9da0ecad9be69dbee2e1abefbc96efa787ec9295efbc8be5b6aaceb3eab6a1e597bd

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)