To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????r[?????????r[^ 0011111100111111001111110011111100111111001111110011111100111111001111110111001001011011001111110011111100111111001111110011111100111111001111110011111100111111011100100101101101011110 3f3f3f3f3f3f3f3f3f725b3f3f3f3f3f3f3f3f3f725b5e
SJIS-WIN 薔e梱鐔わ醜膵э什r[薔e梱鐔わ醜膵э什r[^ 1110010101001011100000101000010110001101101010111110100001011100100000101110110110001111010110001110010001011000100001001000111110001111010110010111001001011011111001010100101110000010100001011000110110101011111010000101110010000010111011011000111101011000111001000101100010000100100011111000111101011001011100100101101101011110 e54b82858dabe85c82ed8f58e458848f8f59725be54b82858dabe85c82ed8f58e458848f8f59725b5e
EUC-JP 薔e梱鐔わ醜膵э什r[薔e梱鐔わ醜膵э什r[^ 1110100110101100101000111110010110111010101011011110111110111101101001001110111110111101101110011110011110111001101001111110111110111101101110100111001001011011111010011010110010100011111001011011101010101101111011111011110110100100111011111011110110111001111001111011100110100111111011111011110110111010011100100101101101011110 e9aca3e5baadefbda4efbdb9e7b9a7efbdba725be9aca3e5baadefbda4efbdb9e7b9a7efbdba725b5e
UTF-8 薔e梱鐔わ醜膵э什r[薔e梱鐔わ醜膵э什r[^ 111010001001011010010100111011111011110110000101111001101010001010110001111010011001000010010100111000111000001010001111111010011000011010011100111010001000011010110101110100011000110111100100101110111000000001110010010110111110100010010110100101001110111110111101100001011110011010100010101100011110100110010000100101001110001110000010100011111110100110000110100111001110100010000110101101011101000110001101111001001011101110000000011100100101101101011110 e89694efbd85e6a2b1e99094e3828fe9869ce886b5d18de4bb80725be89694efbd85e6a2b1e99094e3828fe9869ce886b5d18de4bb80725b5e
UHC 薔e梱?わ醜膵э什r[薔e梱?わ醜膵э什r[^ 111011011111100110100011111001011100110111100001001111111010101011101111111101011101110111110101111111011010110011101111111001001010011101110010010110111110110111111001101000111110010111001101111000010011111110101010111011111111010111011101111101011111110110101100111011111110010010100111011100100101101101011110 edf9a3e5cde13faaeff5ddf5fdacefe4a7725bedf9a3e5cde13faaeff5ddf5fdacefe4a7725b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)