To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????}v????????}vB 001111110011111100111111001111110011111100111111001111110011111101111101011101100011111100111111001111110011111100111111001111110011111100111111011111010111011001000010 3f3f3f3f3f3f3f3f7d763f3f3f3f3f3f3f3f7d7642
SJIS-WIN 偲治偲治偲鴫偲式}v偲治偲治偲鴫偲式}vB 10001110110000111000111010100001100011101100001110001110101000011000111011000011100011101011000010001110110000111000111010101110011111010111011010001110110000111000111010100001100011101100001110001110101000011000111011000011100011101011000010001110110000111000111010101110011111010111011001000010 8ec38ea18ec38ea18ec38eb08ec38eae7d768ec38ea18ec38ea18ec38eb08ec38eae7d7642
EUC-JP 偲治偲治偲鴫偲式}v偲治偲治偲鴫偲式}vB 10111100110001011011110010100011101111001100010110111100101000111011110011000101101111001011001010111100110001011011110010110000011111010111011010111100110001011011110010100011101111001100010110111100101000111011110011000101101111001011001010111100110001011011110010110000011111010111011001000010 bcc5bca3bcc5bca3bcc5bcb2bcc5bcb07d76bcc5bca3bcc5bca3bcc5bcb2bcc5bcb07d7642
UTF-8 偲治偲治偲鴫偲式}v偲治偲治偲鴫偲式}vB 1110010110000001101100101110011010110010101110111110010110000001101100101110011010110010101110111110010110000001101100101110100110110100101010111110010110000001101100101110010110111100100011110111110101110110111001011000000110110010111001101011001010111011111001011000000110110010111001101011001010111011111001011000000110110010111010011011010010101011111001011000000110110010111001011011110010001111011111010111011001000010 e581b2e6b2bbe581b2e6b2bbe581b2e9b4abe581b2e5bc8f7d76e581b2e6b2bbe581b2e6b2bbe581b2e9b4abe581b2e5bc8f7d7642
UHC ?治?治???式}v?治?治???式}vB 001111111111011010111101001111111111011010111101001111110011111100111111111000111101001001111101011101100011111111110110101111010011111111110110101111010011111100111111001111111110001111010010011111010111011001000010 3ff6bd3ff6bd3f3f3fe3d27d763ff6bd3ff6bd3f3f3fe3d27d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)