To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????n}????????n{^ 001111110011111100111111001111110011111100111111001111110011111101101110011111010011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 韶タ鈆ヒ韶タ鈆ヒn}韶タ鈆ヒ韶タ鈆ヒn{^ 1110100011101110110000001111101111000001110010111110100011101110110000001111101111000001110010110110111001111101111010001110111011000000111110111100000111001011111010001110111011000000111110111100000111001011011011100111101101011110 e8eec0fbc1cbe8eec0fbc1cb6e7de8eec0fbc1cbe8eec0fbc1cb6e7b5e
EUC-JP 韶タ鈆ヒ韶タ鈆ヒn}韶タ鈆ヒ韶タ鈆ヒn{^ 1111000011110000100011101100000010001111111000111011110010001110110010111111000011110000100011101100000010001111111000111011110010001110110010110110111001111101111100001111000010001110110000001000111111100011101111001000111011001011111100001111000010001110110000001000111111100011101111001000111011001011011011100111101101011110 f0f08ec08fe3bc8ecbf0f08ec08fe3bc8ecb6e7df0f08ec08fe3bc8ecbf0f08ec08fe3bc8ecb6e7b5e
UTF-8 韶タ鈆ヒ韶タ鈆ヒn}韶タ鈆ヒ韶タ鈆ヒn{^ 1110100110011111101101101110111110111110100000001110100110001000100001101110111110111110100010111110100110011111101101101110111110111110100000001110100110001000100001101110111110111110100010110110111001111101111010011001111110110110111011111011111010000000111010011000100010000110111011111011111010001011111010011001111110110110111011111011111010000000111010011000100010000110111011111011111010001011011011100111101101011110 e99fb6efbe80e98886efbe8be99fb6efbe80e98886efbe8b6e7de99fb6efbe80e98886efbe8be99fb6efbe80e98886efbe8b6e7b5e
UHC 韶???韶???n}韶???韶???n{^ 11100001110100100011111100111111001111111110000111010010001111110011111100111111011011100111110111100001110100100011111100111111001111111110000111010010001111110011111100111111011011100111101101011110 e1d23f3f3fe1d23f3f3f6e7de1d23f3f3fe1d23f3f3f6e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)