To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???泣ゆ??ъ?嚴щ?寃??垂??日 001111110011111100111111100010111000001110000010111001000011111100111111100001001000110000111111100110101000111010000100100010110011111110011011100000110011111100111111100100001000001000111111001111111001001111111010 3f3f3f8b8382e43f3f848c3f9a8e848b3f9b833f3f90823f3f93fa
EUC-JP ???泣ゆ?蓀ъ?嚴щ?寃??垂??日 0011111100111111001111111011010111100011101001001110011000111111100011111101100011111000101001111110110000111111110100111110111010100111111010110011111111010101111000110011111100111111101111111110001000111111001111111100011011111100 3f3f3fb5e3a4e63f8fd8f8a7ec3fd3eea7eb3fd5e33f3fbfe23f3fc6fc
UTF-8 捻꿔꺂泣ゆ벚蓀ъ쐯嚴щㅏ寃긷슖垂귥눦日 11101111101001101010010011101010101111111001010011101010101110101000001011100110101100111010001111100011100000101000011011101011101100101001101011101000100100111000000011010001100010101110110010010000101011111110010110011010101101001101000110001001111000111000010110001111111001011010111110000011111010101011100010110111111011001000101010010110111001011001111010000010111010101011011110100101111010111000100010100110111001101001011110100101 efa6a4eabf94eaba82e6b3a3e38286ebb29ae89380d18aec90afe59ab4d189e3858fe5af83eab8b7ec8a96e59e82eab7a5eb88a6e697a5
UHC 捻꿔꺂泣ゆ벚蓀ъ쐯嚴щㅏ寃긷슖垂귥눦日 1110011011110111101100101110001110000011101010111110101111101000101010101110011010111010101000101110000111100000101011001110110010011100100100111110010111110001101011001110101110100100101111111110101010110010101100011110010110011010101001011110000111110111100000101110110010000111101111011110110011101101 e6f7b2e383abebe8aae6baa2e1e0acec9c93e5f1aceba4bfeab2b1e59aa5e1f782ec87bdeced

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)