To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 贈???垣?第?憶烽?蔚?垣?第?憶六 1001000110100001001111110011111100111111100010100101111100111111100100011110011000111111100010011010111111100000100000100011111110001001010101010011111110001010010111110011111110010001111001100011111110001001101011111001100001011010 91a13f3f3f8a5f3f91e63f89afe0823f89553f8a5f3f91e63f89af985a
EUC-JP 贈???垣?第?憶烽潾蔚?垣?第?憶六 11000010101000110011111100111111001111111011001111000000001111111100001011101000001111111011001010110001110111111110001010001111110010001110001010110001101101100011111110110011110000000011111111000010111010000011111110110010101100011100111110111011 c2a33f3f3fb3c03fc2e83fb2b1dfe28fc8e2b1b63fb3c03fc2e83fb2b1cfbb
UTF-8 贈쭹렍렯垣렖第렞憶烽潾蔚렯垣렖第렞憶六 111010001011010010001000111011001010110110111001111010111010000010001101111010111010000010101111111001011001111010100011111010111010000010010110111001111010110010101100111010111010000010011110111001101000011010110110111001111000001110111101111001101011110110111110111010001001010010011010111010111010000010101111111001011001111010100011111010111010000010010110111001111010110010101100111010111010000010011110111001101000011010110110111001011000010110101101 e8b488ecadb9eba08deba0afe59ea3eba096e7acaceba09ee686b6e783bde6bdbee8949aeba0afe59ea3eba096e7acaceba09ee686b6e585ad
UHC 贈쭹렍렯垣렖第렞憶烽潾蔚렯垣렖第렞憶六 1111000111111100110000101110011110001110101000111000111010111100111010101010111110001110101010111111000010101111100011101010111111100101111000111101110011101011110101111111000111101010101001011000111010111100111010101010111110001110101010111111000010101111100011101010111111100101111000111101011110111111 f1fcc2e78ea38ebceaaf8eabf0af8eafe5e3dcebd7f1eaa58ebceaaf8eabf0af8eafe5e3d7bf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)