To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???踰??乙λ????蚓??儒??億??? 00111111001111110011111111100110111110100011111100111111100010011011001110000011110010010011111100111111001111110011111111100101011011010011111100111111100011101111001000111111001111111000100110101101001111110011111100111111 3f3f3fe6fa3f3f89b383c93f3f3f3fe56d3f3f8ef23f3f89ad3f3f3f
EUC-JP ???踰??乙λ????蚓??儒??億??沅 001111110011111100111111111011001111110000111111001111111011001010110101101001101100101100111111001111110011111100111111111010011100111000111111001111111011110011110100001111110011111110110010101011110011111100111111100011111100011011101001 3f3f3fecfc3f3fb2b5a6cb3f3f3f3fe9ce3f3fbcf43f3fb2af3f3f8fc6e9
UTF-8 閱묐갭踰딀룚乙λ겱廬믩챷蚓껆뫀儒밸윪億됰떥沅 1110100110010110101100011110101110101100100100001110101010110000101011011110100010111000101100001110101110010100100000001110101110100011100110101110010010111001100110011100111010111011111010101011001010110001111011111010011010000010111010111010111110101001111011001011000110110111111010001001101010010011111010101011101110000110111010111010101110000000111001011000010010010010111010111011000010111000111011001001110010101010111001011000010010000100111010111001000010110000111010111001011010100101111001101011001010000101 e996b1ebac90eab0ade8b8b0eb9480eba39ae4b999cebbeab2b1efa682ebafa9ecb1b7e89a93eabb86ebab80e58492ebb0b8ec9caae58484eb90b0eb96a5e6b285
UHC 閱묐갭踰딀룚乙λ겱廬믩챷蚓껆뫀儒밸윪億됰떥沅 1110011011110011100100011110101110110000101110001110101110110010100010101110011010001111100101101110101111100000101001011110101110000001101111011110010111111110100100101110101110101010100001001110110011100010100000111110011110010001101001001110101011100011101110011110101110011111101010011110010111100010100010011110101110001011101110001110101010110110 e6f391ebb0b8ebb28ae68f96ebe0a5eb81bde5fe92ebaa84ece283e791a4eae3b9eb9fa9e5e289eb8bb8eab6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)