To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???竊??伎???儒?????誼??兪?? 001111110011111100111111111000101000011000111111001111111000101011101010001111110011111100111111100011101111001000111111001111110011111100111111001111111000101101100010001111110011111110011001011000000011111100111111 3f3f3fe2863f3f8aea3f3f3f8ef23f3f3f3f3f8b623f3f99603f3f
EUC-JP ???竊??伎彛??儒?????誼??兪?? 0011111100111111001111111110001111100110001111110011111110110100111011001000111110111100111110100011111100111111101111001111010000111111001111110011111100111111001111111011010111000011001111110011111111010001110000010011111100111111 3f3f3fe3e63f3fb4ec8fbcfa3f3fbcf43f3f3f3f3fb5c33f3fd1c13f3f
UTF-8 捻뀁뮆竊섇츦伎彛롥쑵儒얠퐶黎앸럽誼녺윢兪곸물 111011111010011010100100111010111000000010000001111010111010111010000110111001111010101110001010111011001000010010000111111011001011100010100110111001001011110010001110111001011011110110011011111010111010000110100101111011001001000110110101111001011000010010010010111011001001011010100000111011011001000010110110111011111010011010001001111011001001010110111000111010111001111110111101111010001010101010111100111010111000010110111010111011001001110010100010111001011000010110101010111010101011001110111000111010111010110010111100 efa6a4eb8081ebae86e7ab8aec8487ecb8a6e4bc8ee5bd9beba1a5ec91b5e58492ec96a0ed90b6efa689ec95b8eb9fbde8aabceb85baec9ca2e585aaeab3b8ebacbc
UHC 捻뀁뮆竊섇츦伎彛롥쑵儒얠퐶黎앸럽誼녺윢兪곸물 1110011011110111101100101110110010010010100101011110111110111100100110001110010110101110100111001101000011101011111011001010110110001110111001011011111010101010111010101110001110111110111011001011110110011111111001101011000110011101111010111011011110110100111010111111111010000110111001111001111110100011111010101110010010000001111011001011100110110000 e6f7b2ec9295efbc98e5ae9cd0ebecad8ee5beaaeae3beecbd9fe6b19debb7b4ebfe86e79fa3eae481ecb9b0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)