To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????[????????[^ 00111111001111110011111100111111001111110011111100111111001111110101101100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 鱈損旦遜狸村旦遜[鱈損旦遜狸村旦遜[^ 1001001001001100100100011011100110010010010101011001000110111011100100100100101110010001101110101001001001010101100100011011101101011011100100100100110010010001101110011001001001010101100100011011101110010010010010111001000110111010100100100101010110010001101110110101101101011110 924c91b9925591bb924b91ba925591bb5b924c91b9925591bb924b91ba925591bb5b5e
EUC-JP 鱈損旦遜狸村旦遜[鱈損旦遜狸村旦遜[^ 1100001110101101110000101011101111000011101101101100001010111101110000111010110011000010101111001100001110110110110000101011110101011011110000111010110111000010101110111100001110110110110000101011110111000011101011001100001010111100110000111011011011000010101111010101101101011110 c3adc2bbc3b6c2bdc3acc2bcc3b6c2bd5bc3adc2bbc3b6c2bdc3acc2bcc3b6c2bd5b5e
UTF-8 鱈損旦遜狸村旦遜[鱈損旦遜狸村旦遜[^ 111010011011000110001000111001101001000010001101111001101001011110100110111010011000000110011100111001111000101110111000111001101001110110010001111001101001011110100110111010011000000110011100010110111110100110110001100010001110011010010000100011011110011010010111101001101110100110000001100111001110011110001011101110001110011010011101100100011110011010010111101001101110100110000001100111000101101101011110 e9b188e6908de697a6e9819ce78bb8e69d91e697a6e9819c5be9b188e6908de697a6e9819ce78bb8e69d91e697a6e9819c5b5e
UHC ?損旦遜狸村旦遜[?損旦遜狸村旦遜[^ 001111111110000111011111110100111010100111100001111000011101011111100001111101011011110111010011101010011110000111100001010110110011111111100001110111111101001110101001111000011110000111010111111000011111010110111101110100111010100111100001111000010101101101011110 3fe1dfd3a9e1e1d7e1f5bdd3a9e1e15b3fe1dfd3a9e1e1d7e1f5bdd3a9e1e15b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)