To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 存驀塗?肆???⇒[存驀塗?肆???⇒[^ 10010001101101101110100101111101100100110110100000111111111000111110011000111111001111110011111110000001110010110101101110010001101101101110100101111101100100110110100000111111111000111110011000111111001111110011111110000001110010110101101101011110 91b6e97d93683fe3e63f3f3f81cb5b91b6e97d93683fe3e63f3f3f81cb5b5e
EUC-JP 存驀塗翟肆???⇒[存驀塗翟肆???⇒[^ 1100001010111000111100011101111011000101110010011000111111010101101111001110011011101000001111110011111100111111101000101100110101011011110000101011100011110001110111101100010111001001100011111101010110111100111001101110100000111111001111110011111110100010110011010101101101011110 c2b8f1dec5c98fd5bce6e83f3f3fa2cd5bc2b8f1dec5c98fd5bce6e83f3f3fa2cd5b5e
UTF-8 存驀塗翟肆범臨펫⇒[存驀塗翟肆범臨펫⇒[^ 111001011010110110011000111010011010100110000000111001011010000110010111111001111011111110011111111010001000001010000110111010111011001010010100111011111010011110110110111011011000111010101011111000101000011110010010010110111110010110101101100110001110100110101001100000001110010110100001100101111110011110111111100111111110100010000010100001101110101110110010100101001110111110100111101101101110110110001110101010111110001010000111100100100101101101011110 e5ad98e9a980e5a197e7bf9fe88286ebb294efa7b6ed8eabe287925be5ad98e9a980e5a197e7bf9fe88286ebb294efa7b6ed8eabe287925b5e
UHC 存驀塗翟肆범臨펫⇒[存驀塗翟肆범臨펫⇒[^ 111100001110110111011000111010011101001111110011111011101110000111011110111010111011100111111100111011001111101011000110111010101010001010100001010110111111000011101101110110001110100111010011111100111110111011100001110111101110101110111001111111001110110011111010110001101110101010100010101000010101101101011110 f0edd8e9d3f3eee1deebb9fcecfac6eaa2a15bf0edd8e9d3f3eee1deebb9fcecfac6eaa2a15b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)