To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 畏????????節e<鸚??言????? 100010001101100000111111001111110011111100111111001111110011111100111111001111111001000011011111100000101000010110000001100000111110101001011111001111110011111110001100101111100011111100111111001111110011111100111111 88d83f3f3f3f3f3f3f3f90df82858183ea5f3f3f8cbe3f3f3f3f3f
EUC-JP 畏????????節e<鸚??言????? 101100001101101000111111001111110011111100111111001111110011111100111111001111111100000011100001101000111110010110100001111000111111001111000000001111110011111110111000110000000011111100111111001111110011111100111111 b0da3f3f3f3f3f3f3f3fc0e1a3e5a1e3f3c03f3fb8c03f3f3f3f3f
UTF-8 畏뷂쉽樂쒙슁筽꾤뵽節e<鸚뗩뼻言븅츗樂쒙슁 111001111001010110001111111010111011011110000010111011001000100110111101111011111010011010111111111011001001001010011001111011001000101010000001111001111010110110111101111010101011111010100100111010111011010110111101111001111010111110000000111011111011110110000101111011111011110010011100111010011011100010011010111010111001011110101001111010111011110010111011111010001010100010000000111010111011100010000101111011001011100010010111111011111010011010111111111011001001001010011001111011001000101010000001 e7958febb782ec89bdefa6bfec9299ec8a81e7adbdeabea4ebb5bde7af80efbd85efbc9ce9b89aeb97a9ebbcbbe8a880ebb885ecb897efa6bfec9299ec8a81
UHC 畏뷂쉽樂쒙슁筽꾤뵽節e<鸚뗩뼻言븅츗樂쒙슁 111010001110011010010100111011111011110110110001111010001111100110011100111011111011110110110011111010001010010010000100111001111001010010111011111011111011110110100011111001011010001110111100111001011010010010001011111010011001011010111110111001011110101110111010111010011010111010010001111010001111100110011100111011111011110110110011 e8e694efbdb1e8f99cefbdb3e8a484e794bbefbda3e5a3bce5a48be996bee5ebbae9ae91e8f99cefbdb3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)