To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???竊??伎?ヨ?惟??茹?????攸?? 00111111001111110011111111100010100001100011111100111111100010101110101000111111100000111000100000111111100010001101001000111111001111111110010010100101001111110011111100111111001111110011111110011101101111110011111100111111 3f3f3fe2863f3f8aea3f83883f88d23f3fe4a53f3f3f3f3f9dbf3f3f
EUC-JP ???竊??伎彛ヨ?惟??茹?????攸?? 001111110011111100111111111000111110011000111111001111111011010011101100100011111011110011111010101001011110100000111111101100001101010000111111001111111110100010100111001111110011111100111111001111110011111111011010110000010011111100111111 3f3f3fe3e63f3fb4ec8fbcfaa5e83fb0d43f3fe8a73f3f3f3f3fdac13f3f
UTF-8 捻뀁뮆竊섇츦伎彛ヨ눧惟곗퓚茹띿슜梨욘뿥攸껎닧 111011111010011010100100111010111000000010000001111010111010111010000110111001111010101110001010111011001000010010000111111011001011100010100110111001001011110010001110111001011011110110011011111000111000001110101000111010111000100010100111111001101000001110011111111010101011001110010111111011011001001110011010111010001000110010111001111010111001110110111111111011001000101010011100111011111010011110100010111011001001101010011000111010111011111110100101111001101001010010111000111010101011101110001110111010111000101110100111 efa6a4eb8081ebae86e7ab8aec8487ecb8a6e4bc8ee5bd9be383a8eb88a7e6839feab397ed939ae88cb9eb9dbfec8a9cefa7a2ec9a98ebbfa5e694b8eabb8eeb8ba7
UHC 捻뀁뮆竊섇츦伎彛ヨ눧惟곗퓚茹띿슜梨욘뿥攸껎닧 1110011011110111101100101110110010010010100101011110111110111100100110001110010110101110100111001101000011101011111011001010110110101011111010001000011110111110111010101110111010110000111011001011111110000101111001101010101010001101111011001001101010101001111011001011000110111111111001101001011110100101111010101111001010000011111011011000100010100011 e6f7b2ec9295efbc98e5ae9cd0ebecadabe887beeaeeb0ecbf85e6aa8dec9aa9ecb1bfe697a5eaf283ed88a3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)