To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 筌??怡ワ????筌?‘肄??音??筌c?猿 1110001010100011001111110011111110011100011111011000001110001111001111110011111100111111001111111110001010100011001111111000000101100101111000111110010100111111001111111000100110111001001111110011111111100010101000111000001010000011001111111000100110001110 e2a33f3f9c7d838f3f3f3f3fe2a33f8165e3e53f3f89b93f3fe2a382833f898e
EUC-JP 筌??怡ワ????筌?‘肄??音??筌c?猿 1110010010100101001111110011111111010111110111101010010111101111001111110011111100111111001111111110010010100101001111111010000111000110111001101110011100111111001111111011001010111011001111110011111111100100101001011010001111100011001111111011000111101110 e4a53f3fd7dea5ef3f3f3f3fe4a53fa1c6e6e73f3fb2bb3f3fe4a5a3e33fb1ee
UTF-8 筌듬냱怡ワ쭪類ㅺ퍕筌듬‘肄뚳쭪音깆췅筌c꺃猿 111001111010110110001100111010111001001110101100111010111000001110110001111001101000000010100001111000111000001110101111111011001010110110101010111011111010011110010000111000111000010110111010111011011000110110010101111001111010110110001100111010111001001110101100111000101000000010011000111010001000001010000100111010111001101010110011111011001010110110101010111010011001111110110011111010101011100110000110111011001011011110000101111001111010110110001100111011111011110110000011111010101011101010000011111001111000110010111111 e7ad8ceb93aceb83b1e680a1e383afecadaaefa790e385baed8d95e7ad8ceb93ace28098e88284eb9ab3ecadaae99fb3eab986ecb785e7ad8cefbd83eaba83e78cbf
UHC 筌듬냱怡ワ쭪類ㅺ퍕筌듬‘肄뚳쭪音깆췅筌c꺃猿 1110111110100111101101011110101110000110100000011110110010101110101010111110111110100111100111101110101110111010101001001110101010111011100011001110111110100111101101011110101110100001101011101110110010111101100011001110111110100111100111101110101111100101101100011110110010101101101000001110111110100111101000111110001110000011101011001110101010111011 efa7b5eb8681ecaeabefa79eebbaa4eabb8cefa7b5eba1aeecbd8cefa79eebe5b1ecada0efa7a3e383aceabb

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)