To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????~???????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101111110001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f7e3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???誼→????獄??~???誼→????獄?? 00111111001111110011111110001011011000101000000110101000001111110011111100111111001111111000110110010110001111110011111101111110001111110011111100111111100010110110001010000001101010000011111100111111001111110011111110001101100101100011111100111111 3f3f3f8b6281a83f3f3f3f8d963f3f7e3f3f3f8b6281a83f3f3f3f8d963f3f
EUC-JP ???誼→????獄??~???誼→????獄?? 00111111001111110011111110110101110000111010001010101010001111110011111100111111001111111011100111110110001111110011111101111110001111110011111100111111101101011100001110100010101010100011111100111111001111110011111110111001111101100011111100111111 3f3f3fb5c3a2aa3f3f3f3fb9f63f3f7e3f3f3fb5c3a2aa3f3f3f3fb9f63f3f
UTF-8 凉깅돃誼→샍戮고뭵獄쏄큷~凉깅돃誼→샍戮고뭵獄쏄큷 11101111101001011011100111101010101110011000010111101011100011111000001111101000101010101011110011100010100001101001001011101100100000111000110111101111101001111001001011101010101100111010000011101011101011011011010111100111100011011000010011101100100011111000010011101101100000011011011101111110111011111010010110111001111010101011100110000101111010111000111110000011111010001010101010111100111000101000011010010010111011001000001110001101111011111010011110010010111010101011001110100000111010111010110110110101111001111000110110000100111011001000111110000100111011011000000110110111 efa5b9eab985eb8f83e8aabce28692ec838defa792eab3a0ebadb5e78d84ec8f84ed81b77eefa5b9eab985eb8f83e8aabce28692ec838defa792eab3a0ebadb5e78d84ec8f84ed81b7
UHC 凉깅돃誼→샍戮고뭵獄쏄큷~凉깅돃誼→샍戮고뭵獄쏄큷 11100101101111001011000111101011100010011001011011101011111111101010000111100110100110001011101111101011101111011011000011101101100100101000010011101000101010111001101111101010101101001000011001111110111001011011110010110001111010111000100110010110111010111111111010100001111001101001100010111011111010111011110110110000111011011001001010000100111010001010101110011011111010101011010010000110 e5bcb1eb8996ebfea1e698bbebbdb0ed9284e8ab9beab4867ee5bcb1eb8996ebfea1e698bbebbdb0ed9284e8ab9beab486

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)