To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 鞨オ蝟懶ス「繧手ョ馴し蔬懶ス「籵剰揃B 11101000111000001011010111100101100101101001110011101111101111011010001011100011100000101000111011101000101011101001001111101001100000101011010111100100111101001001110011101111101111011010001011100010111000001000111111101000100100011011010101000010 e8e0b5e5969cefbda2e3828ee8ae93e982b5e4f49cefbda2e2e08fe891b542
EUC-JP 鞨オ蝟懶ス「繧手ョ馴し蔬懶ス「籵剰揃B 11110000111000101000111010110101111010011111011011011000111100011000111010111101100011101010001011100101111000101011110011101010100011101010111011000110111010111010010010110111111010001111011011011000111100011000111010111101100011101010001011100100111000101011111011101010110000101011011101000010 f0e28eb5e9f6d8f18ebd8ea2e5e2bcea8eaec6eba4b7e8f6d8f18ebd8ea2e4e2beeac2b742
UTF-8 鞨オ蝟懶ス「繧手ョ馴し蔬懶ス「籵剰揃B 11101001100111101010100011101111101111011011010111101000100111011001111111100110100001111011011011101111101111011011110111101111101111011010001011100111101110011010011111100110100010011000101111101111101111011010111011101001101001101011010011100011100000011001011111101000100101001010110011100110100001111011011011101111101111011011110111101111101111011010001011100111101100011011010111100101100010011011000011100110100011111000001101000010 e99ea8efbdb5e89d9fe687b6efbdbdefbda2e7b9a7e6898befbdaee9a6b4e38197e894ace687b6efbdbdefbda2e7b1b5e589b0e68f8342
UHC 鞨?蝟懶???手?馴し蔬懶?????B 110010101110101000111111111010101101101011010100111110110011111100111111001111111110001010100010001111111110001011111000101010101011011111100001110010101101010011111011001111110011111100111111001111110011111101000010 caea3feadad4fb3f3f3fe2a23fe2f8aab7e1cad4fb3f3f3f3f3f42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)