To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???竊??巡??筌??竊????????應? 00111111001111110011111111100010100001100011111100111111100011111000010000111111001111111110001010100011001111110011111111100010100001100011111100111111001111110011111100111111001111110011111100111111100111001110010000111111 3f3f3fe2863f3f8f843f3fe2a33f3fe2863f3f3f3f3f3f3f3f9ce43f
EUC-JP ???竊??巡??筌??竊??洹?????應? 001111110011111100111111111000111110011000111111001111111011110111100100001111110011111111100100101001010011111100111111111000111110011000111111001111111000111111000111101110100011111100111111001111110011111100111111110110001110011000111111 3f3f3fe3e63f3fbde43f3fe4a53f3fe3e63f3f8fc7ba3f3f3f3f3fd8e63f
UTF-8 呂얜벉竊뺟쑴巡볩폊筌뉗뮆竊뽳㎗洹숆깻列룸씈應쿍 111011111010011010000000111011001001011010011100111010111011001010001001111001111010101110001010111010111011101010011111111011001001000110110100111001011011011110100001111010111011001110101001111011011000111110001010111001111010110110001100111010111000100110010111111010111010111010000110111001111010101110001010111010111011110110110011111000111000111010010111111001101011010010111001111011001000100010000110111010101011100110111011111011111010011010011100111010111010001110111000111011001001010010001000111001101000011110001001111011001011111110001101 efa680ec969cebb289e7ab8aebba9fec91b4e5b7a1ebb3a9ed8f8ae7ad8ceb8997ebae86e7ab8aebbdb3e38e97e6b4b9ec8886eab9bbefa69ceba3b8ec9488e68789ecbf8d
UHC 呂얜벉竊뺟쑴巡볩폊筌뉗뮆竊뽳㎗洹숆깻列룸씈應쿍 11100101111110111011111011101011100100111010110011101111101111001001010111100111101111101010100111100010110111101001001111101111101111001001010111101111101001111000011111101100100100101001010111101111101111001001011011101111101001111010001111101010101101111001100111101010101100101010001011100110111010101011011111101011100111011010000011101011111010111011001101000010 e5fbbeeb93acefbc95e7bea9e2de93efbc95efa787ec9295efbc96efa7a3eab799eab2a2e6eab7eb9da0ebebb342

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)