To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????~???????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101111110001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f7e3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???誼→????獄??~???誼→????獄?? 00111111001111110011111110001011011000101000000110101000001111110011111100111111001111111000110110010110001111110011111101111110001111110011111100111111100010110110001010000001101010000011111100111111001111110011111110001101100101100011111100111111 3f3f3f8b6281a83f3f3f3f8d963f3f7e3f3f3f8b6281a83f3f3f3f8d963f3f
EUC-JP 獒??誼→????獄??~獒??誼→????獄?? 1000111111001011101110110011111100111111101101011100001110100010101010100011111100111111001111110011111110111001111101100011111100111111011111101000111111001011101110110011111100111111101101011100001110100010101010100011111100111111001111110011111110111001111101100011111100111111 8fcbbb3f3fb5c3a2aa3f3f3f3fb9f63f3f7e8fcbbb3f3fb5c3a2aa3f3f3f3fb9f63f3f
UTF-8 獒붿룆誼→샍戮고뭵獄쏄큷~獒붿룆誼→샍戮고뭵獄쏄큷 11100111100011011001001011101011101101101011111111101011101000111000011011101000101010101011110011100010100001101001001011101100100000111000110111101111101001111001001011101010101100111010000011101011101011011011010111100111100011011000010011101100100011111000010011101101100000011011011101111110111001111000110110010010111010111011011010111111111010111010001110000110111010001010101010111100111000101000011010010010111011001000001110001101111011111010011110010010111010101011001110100000111010111010110110110101111001111000110110000100111011001000111110000100111011011000000110110111 e78d92ebb6bfeba386e8aabce28692ec838defa792eab3a0ebadb5e78d84ec8f84ed81b77ee78d92ebb6bfeba386e8aabce28692ec838defa792eab3a0ebadb5e78d84ec8f84ed81b7
UHC 獒붿룆誼→샍戮고뭵獄쏄큷~獒붿룆誼→샍戮고뭵獄쏄큷 11101000101000111001010011101100100011111000010111101011111111101010000111100110100110001011101111101011101111011011000011101101100100101000010011101000101010111001101111101010101101001000011001111110111010001010001110010100111011001000111110000101111010111111111010100001111001101001100010111011111010111011110110110000111011011001001010000100111010001010101110011011111010101011010010000110 e8a394ec8f85ebfea1e698bbebbdb0ed9284e8ab9beab4867ee8a394ec8f85ebfea1e698bbebbdb0ed9284e8ab9beab486

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)