To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 藏먲풐將됵슬鸚켫 111010001001011110001111111010111010100010110010111011011001001010010000111001011011000010000111111010111001000010110101111011001000101010101100111010011011100010011010111011001011110010101011 e8978feba8b2ed9290e5b087eb90b5ec8aace9b89aecbcab
SJIS-WIN ????¨?????°??????¬?????? 001111110011111100111111001111111000000101001110001111110011111100111111001111110011111110000001100010110011111100111111001111110011111100111111001111111000000111001010001111110011111100111111001111110011111100111111 3f3f3f3f814e3f3f3f3f3f818b3f3f3f3f3f3f81ca3f3f3f3f3f3f
EUC-JP è??ë¨?í??å°?ë??ì?¬é¸?ì?? 100011111010101110110010001111110011111110001111101010111011001110100001101011110011111110001111101010111011111100111111001111111000111110101011101010011010000111101011001111111000111110101011101100110011111100111111100011111010101111000000001111111010001011001100100011111010101110110001100011111010001010110001001111111000111110101011110000000011111100111111 8fabb23f3f8fabb3a1af3f8fabbf3f3f8faba9a1eb3f8fabb33f3f8fabc03fa2cc8fabb18fa2b13f8fabc03f3f
UTF-8 藏먲풐將됵슬鸚켫 110000111010100011000010100101111100001010001111110000111010101111000010101010001100001010110010110000111010110111000010100100101100001010010000110000111010010111000010101100001100001010000111110000111010101111000010100100001100001010110101110000111010110011000010100010101100001010101100110000111010100111000010101110001100001010011010110000111010110011000010101111001100001010101011 c3a8c297c28fc3abc2a8c2b2c3adc292c290c3a5c2b0c287c3abc290c2b5c3acc28ac2acc3a9c2b8c29ac3acc2bcc2ab
UHC ????¨²????°????????¸??¼? 0011111100111111001111110011111110100001101001111010100111110111001111110011111100111111001111111010000111000110001111110011111100111111001111110011111100111111001111110011111110100010101011000011111100111111101010001111100100111111 3f3f3f3fa1a7a9f73f3f3f3fa1c63f3f3f3f3f3f3f3fa2ac3f3fa8f93f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)