To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?r??nf?r??n^}Y?r??nf?r??n^}bE 0011111101110010001111110011111101101110011001100011111101110010001111110011111101101110010111100111110101011001001111110111001000111111001111110110111001100110001111110111001000111111001111110110111001011110011111010110001001000101 3f723f3f6e663f723f3f6e5e7d593f723f3f6e663f723f3f6e5e7d6245
SJIS-WIN 達r奪竪nf達r奪竪n^}Y達r奪竪nf達r奪竪n^}bE 1001001001000010011100101001001001000100100100100100011101101110011001101001001001000010011100101001001001000100100100100100011101101110010111100111110101011001100100100100001001110010100100100100010010010010010001110110111001100110100100100100001001110010100100100100010010010010010001110110111001011110011111010110001001000101 924272924492476e66924272924492476e5e7d59924272924492476e66924272924492476e5e7d6245
EUC-JP 達r奪竪nf達r奪竪n^}Y達r奪竪nf達r奪竪n^}bE 1100001110100011011100101100001110100101110000111010100001101110011001101100001110100011011100101100001110100101110000111010100001101110010111100111110101011001110000111010001101110010110000111010010111000011101010000110111001100110110000111010001101110010110000111010010111000011101010000110111001011110011111010110001001000101 c3a372c3a5c3a86e66c3a372c3a5c3a86e5e7d59c3a372c3a5c3a86e66c3a372c3a5c3a86e5e7d6245
UTF-8 達r奪竪nf達r奪竪n^}Y達r奪竪nf達r奪竪n^}bE 1110100110000001100101000111001011100101101001011010101011100111101010111010101001101110011001101110100110000001100101000111001011100101101001011010101011100111101010111010101001101110010111100111110101011001111010011000000110010100011100101110010110100101101010101110011110101011101010100110111001100110111010011000000110010100011100101110010110100101101010101110011110101011101010100110111001011110011111010110001001000101 e9819472e5a5aae7abaa6e66e9819472e5a5aae7abaa6e5e7d59e9819472e5a5aae7abaa6e66e9819472e5a5aae7abaa6e5e7d6245
UHC 達r奪竪nf達r奪竪n^}Y達r奪竪nf達r奪竪n^}bE 1101001110111001011100101111011110101100111000101011010101101110011001101101001110111001011100101111011110101100111000101011010101101110010111100111110101011001110100111011100101110010111101111010110011100010101101010110111001100110110100111011100101110010111101111010110011100010101101010110111001011110011111010110001001000101 d3b972f7ace2b56e66d3b972f7ace2b56e5e7d59d3b972f7ace2b56e66d3b972f7ace2b56e5e7d6245

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)