To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????n}????????n{^ 001111110011111100111111001111110011111100111111001111110011111101101110011111010011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 阨茨スー陜呻スーn}阨茨スー陜呻スーn{^ 1110100010010101100010001110111110111101101100001110100010011101100110011110111110111101101100000110111001111101111010001001010110001000111011111011110110110000111010001001110110011001111011111011110110110000011011100111101101011110 e89588efbdb0e89d99efbdb06e7de89588efbdb0e89d99efbdb06e7b5e
EUC-JP 阨茨スー陜呻スーn}阨茨スー陜呻スーn{^ 11101111111101011011000011110001100011101011110110001110101100001110111111111101110100101111000110001110101111011000111010110000011011100111110111101111111101011011000011110001100011101011110110001110101100001110111111111101110100101111000110001110101111011000111010110000011011100111101101011110 eff5b0f18ebd8eb0effdd2f18ebd8eb06e7deff5b0f18ebd8eb0effdd2f18ebd8eb06e7b5e
UTF-8 阨茨スー陜呻スーn}阨茨スー陜呻スーn{^ 1110100110011000101010001110100010001100101010001110111110111101101111011110111110111101101100001110100110011001100111001110010110010001101110111110111110111101101111011110111110111101101100000110111001111101111010011001100010101000111010001000110010101000111011111011110110111101111011111011110110110000111010011001100110011100111001011001000110111011111011111011110110111101111011111011110110110000011011100111101101011110 e998a8e88ca8efbdbdefbdb0e9999ce591bbefbdbdefbdb06e7de998a8e88ca8efbdbdefbdb0e9999ce591bbefbdbdefbdb06e7b5e
UHC ?茨??陜呻??n}?茨??陜呻??n{^ 001111111110110110111100001111110011111111111001111100001110001111100010001111110011111101101110011111010011111111101101101111000011111100111111111110011111000011100011111000100011111100111111011011100111101101011110 3fedbc3f3ff9f0e3e23f3f6e7d3fedbc3f3ff9f0e3e23f3f6e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)