To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???竊??擬??壓??肄???λ?筌?? 001111110011111100111111111000101000011000111111001111111000101101011011001111110011111110011010110110000011111100111111111000111110010100111111001111110011111110000011110010010011111111100010101000110011111100111111 3f3f3fe2863f3f8b5b3f3f9ad83f3fe3e53f3f3f83c93fe2a33f3f
EUC-JP ???竊??擬??壓??肄??洹λ?筌?? 0011111100111111001111111110001111100110001111110011111110110101101111000011111100111111110101001101101000111111001111111110011011100111001111110011111110001111110001111011101010100110110010110011111111100100101001010011111100111111 3f3f3fe3e63f3fb5bc3f3fd4da3f3fe6e73f3f8fc7baa6cb3fe4a53f3f
UTF-8 捻뀁뮆竊섉윓擬띿젂壓믩떽肄잏솻洹λ즴筌됰궑 1110111110100110101001001110101110000000100000011110101110101110100001101110011110101011100010101110110010000100100010011110110010011100100100111110011010010011101011001110101110011101101111111110110010100000100000101110010110100011100100111110101110101111101010011110101110010110101111011110100010000010100001001110110010011110100011111110110010000110101110111110011010110100101110011100111010111011111011001010011010110100111001111010110110001100111010111001000010110000111010101011011010010001 efa6a4eb8081ebae86e7ab8aec8489ec9c93e693aceb9dbfeca082e5a393ebafa9eb96bde88284ec9e8fec86bbe6b4b9cebbeca6b4e7ad8ceb90b0eab691
UHC 捻뀁뮆竊섉윓擬띿젂壓믩떽肄잏솻洹λ즴筌됰궑 111001101111011110110010111011001001001010010101111011111011110010011000111001101001111110011010111010111111010010001101111011001010000010000110111001001110001010010010111010111011011010111101111011001011110110011111111001111001100110110000111010101011011110100101111010111010001110000110111011111010011110001001111010111000001010100110 e6f7b2ec9295efbc98e69f9aebf48deca086e4e292ebb6bdecbd9fe799b0eab7a5eba386efa789eb82a6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)