To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 竪多蔵辰存損奪俗息竪多蔵辰存損奪俗息B 10010010010001111001000110111101100100011010000010010010010000111001000110110110100100011011100110010010010001001001000110101101100100011010011110010010010001111001000110111101100100011010000010010010010000111001000110110110100100011011100110010010010001001001000110101101100100011010011101000010 924791bd91a0924391b691b9924491ad91a7924791bd91a0924391b691b9924491ad91a742
EUC-JP 竪多蔵辰存損奪俗息竪多蔵辰存損奪俗息B 11000011101010001100001010111111110000101010001011000011101001001100001010111000110000101011101111000011101001011100001010101111110000101010100111000011101010001100001010111111110000101010001011000011101001001100001010111000110000101011101111000011101001011100001010101111110000101010100101000010 c3a8c2bfc2a2c3a4c2b8c2bbc3a5c2afc2a9c3a8c2bfc2a2c3a4c2b8c2bbc3a5c2afc2a942
UTF-8 竪多蔵辰存損奪俗息竪多蔵辰存損奪俗息B 11100111101010111010101011100101101001001001101011101000100101001011010111101000101111101011000011100101101011011001100011100110100100001000110111100101101001011010101011100100101111111001011111100110100000011010111111100111101010111010101011100101101001001001101011101000100101001011010111101000101111101011000011100101101011011001100011100110100100001000110111100101101001011010101011100100101111111001011111100110100000011010111101000010 e7abaae5a49ae894b5e8beb0e5ad98e6908de5a5aae4bf97e681afe7abaae5a49ae894b5e8beb0e5ad98e6908de5a5aae4bf97e681af42
UHC 竪多?辰存損奪俗息竪多?辰存損奪俗息B 1110001010110101110100101111110100111111111100101110001111110000111011011110000111011111111101111010110011100001110101001110001111010011111000101011010111010010111111010011111111110010111000111111000011101101111000011101111111110111101011001110000111010100111000111101001101000010 e2b5d2fd3ff2e3f0ede1dff7ace1d4e3d3e2b5d2fd3ff2e3f0ede1dff7ace1d4e3d342

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)