To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????n}????????n{^ 001111110011111100111111001111110011111100111111001111110011111101101110011111010011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 逵ク贒サ逵ク鄙ヲn}逵ク贒サ逵ク鄙ヲn{^ 1110011110011100101110001111101110101111101110111110011110011100101110001110011110111111101001100110111001111101111001111001110010111000111110111010111110111011111001111001110010111000111001111011111110100110011011100111101101011110 e79cb8fbafbbe79cb8e7bfa66e7de79cb8fbafbbe79cb8e7bfa66e7b5e
EUC-JP 逵ク贒サ逵ク鄙ヲn}逵ク贒サ逵ク鄙ヲn{^ 111011011111110010001110101110001000111111011111110000111000111010111011111011011111110010001110101110001110111011000001100011101010011001101110011111011110110111111100100011101011100010001111110111111100001110001110101110111110110111111100100011101011100011101110110000011000111010100110011011100111101101011110 edfc8eb88fdfc38ebbedfc8eb8eec18ea66e7dedfc8eb88fdfc38ebbedfc8eb8eec18ea66e7b5e
UTF-8 逵ク贒サ逵ク鄙ヲn}逵ク贒サ逵ク鄙ヲn{^ 1110100110000000101101011110111110111101101110001110100010110100100100101110111110111101101110111110100110000000101101011110111110111101101110001110100110000100100110011110111110111101101001100110111001111101111010011000000010110101111011111011110110111000111010001011010010010010111011111011110110111011111010011000000010110101111011111011110110111000111010011000010010011001111011111011110110100110011011100111101101011110 e980b5efbdb8e8b492efbdbbe980b5efbdb8e98499efbda66e7de980b5efbdb8e8b492efbdbbe980b5efbdb8e98499efbda66e7b5e
UHC 逵???逵?鄙?n}逵???逵?鄙?n{^ 110100001011000000111111001111110011111111010000101100000011111111011110101010010011111101101110011111011101000010110000001111110011111100111111110100001011000000111111110111101010100100111111011011100111101101011110 d0b03f3f3fd0b03fdea93f6e7dd0b03f3f3fd0b03fdea93f6e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)