To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????k}????????k{^ 001111110011111100111111001111110011111100111111001111110011111101101011011111010011111100111111001111110011111100111111001111110011111100111111011010110111101101011110 3f3f3f3f3f3f3f3f6b7d3f3f3f3f3f3f3f3f6b7b5e
SJIS-WIN 鋺茨スカ鞨ゑスェk}鋺茨スカ鞨ゑスェk{^ 1110011111111010100010001110111110111101101101101110100011100000100000101110111110111101101010100110101101111101111001111111101010001000111011111011110110110110111010001110000010000010111011111011110110101010011010110111101101011110 e7fa88efbdb6e8e082efbdaa6b7de7fa88efbdb6e8e082efbdaa6b7b5e
EUC-JP 鋺茨スカ鞨ゑスェk}鋺茨スカ鞨ゑスェk{^ 11101110111111001011000011110001100011101011110110001110101101101111000011100010101001001111000110001110101111011000111010101010011010110111110111101110111111001011000011110001100011101011110110001110101101101111000011100010101001001111000110001110101111011000111010101010011010110111101101011110 eefcb0f18ebd8eb6f0e2a4f18ebd8eaa6b7deefcb0f18ebd8eb6f0e2a4f18ebd8eaa6b7b5e
UTF-8 鋺茨スカ鞨ゑスェk}鋺茨スカ鞨ゑスェk{^ 1110100110001011101110101110100010001100101010001110111110111101101111011110111110111101101101101110100110011110101010001110001110000010100100011110111110111101101111011110111110111101101010100110101101111101111010011000101110111010111010001000110010101000111011111011110110111101111011111011110110110110111010011001111010101000111000111000001010010001111011111011110110111101111011111011110110101010011010110111101101011110 e98bbae88ca8efbdbdefbdb6e99ea8e38291efbdbdefbdaa6b7de98bbae88ca8efbdbdefbdb6e99ea8e38291efbdbdefbdaa6b7b5e
UHC ?茨??鞨ゑ??k}?茨??鞨ゑ??k{^ 001111111110110110111100001111110011111111001010111010101010101011110001001111110011111101101011011111010011111111101101101111000011111100111111110010101110101010101010111100010011111100111111011010110111101101011110 3fedbc3f3fcaeaaaf13f3f6b7d3fedbc3f3fcaeaaaf13f3f6b7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)