To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?d??nf?d??n^}Y?d??nf?d??n^}bE 0011111101100100001111110011111101101110011001100011111101100100001111110011111101101110010111100111110101011001001111110110010000111111001111110110111001100110001111110110010000111111001111110110111001011110011111010110001001000101 3f643f3f6e663f643f3f6e5e7d593f643f3f6e663f643f3f6e5e7d6245
SJIS-WIN 達d叩炭nf達d叩炭n^}Y達d叩炭nf達d叩炭n^}bE 1001001001000010011001001001001001000000100100100101100101101110011001101001001001000010011001001001001001000000100100100101100101101110010111100111110101011001100100100100001001100100100100100100000010010010010110010110111001100110100100100100001001100100100100100100000010010010010110010110111001011110011111010110001001000101 924264924092596e66924264924092596e5e7d59924264924092596e66924264924092596e5e7d6245
EUC-JP 達d叩炭nf達d叩炭n^}Y達d叩炭nf達d叩炭n^}bE 1100001110100011011001001100001110100001110000111011101001101110011001101100001110100011011001001100001110100001110000111011101001101110010111100111110101011001110000111010001101100100110000111010000111000011101110100110111001100110110000111010001101100100110000111010000111000011101110100110111001011110011111010110001001000101 c3a364c3a1c3ba6e66c3a364c3a1c3ba6e5e7d59c3a364c3a1c3ba6e66c3a364c3a1c3ba6e5e7d6245
UTF-8 達d叩炭nf達d叩炭n^}Y達d叩炭nf達d叩炭n^}bE 1110100110000001100101000110010011100101100011111010100111100111100000101010110101101110011001101110100110000001100101000110010011100101100011111010100111100111100000101010110101101110010111100111110101011001111010011000000110010100011001001110010110001111101010011110011110000010101011010110111001100110111010011000000110010100011001001110010110001111101010011110011110000010101011010110111001011110011111010110001001000101 e9819464e58fa9e782ad6e66e9819464e58fa9e782ad6e5e7d59e9819464e58fa9e782ad6e66e9819464e58fa9e782ad6e5e7d6245
UHC 達d叩炭nf達d叩炭n^}Y達d叩炭nf達d叩炭n^}bE 1101001110111001011001001100110110110000111101111010100101101110011001101101001110111001011001001100110110110000111101111010100101101110010111100111110101011001110100111011100101100100110011011011000011110111101010010110111001100110110100111011100101100100110011011011000011110111101010010110111001011110011111010110001001000101 d3b964cdb0f7a96e66d3b964cdb0f7a96e5e7d59d3b964cdb0f7a96e66d3b964cdb0f7a96e5e7d6245

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)