To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 荼居輯B}v荼居輯B}vB 11101000100011011011110011100101101100011000010111101000101111001010111101000010011111010111011011101000100011011011110011100101101100011000010111101000101111001010111101000010011111010111011001000010 e88dbce5b185e8bcaf427d76e88dbce5b185e8bcaf427d7642
SJIS-WIN ????±????B}v????±????B}vB 001111110011111100111111001111111000000101111101001111110011111100111111001111110100001001111101011101100011111100111111001111110011111110000001011111010011111100111111001111110011111101000010011111010111011001000010 3f3f3f3f817d3f3f3f3f427d763f3f3f3f817d3f3f3f3f427d7642
EUC-JP è??å±?è?¯B}vè??å±?è?¯B}vB 10001111101010111011001000111111001111111000111110101011101010011010000111011110001111111000111110101011101100100011111110001111101000101011010001000010011111010111011010001111101010111011001000111111001111111000111110101011101010011010000111011110001111111000111110101011101100100011111110001111101000101011010001000010011111010111011001000010 8fabb23f3f8faba9a1de3f8fabb23f8fa2b4427d768fabb23f3f8faba9a1de3f8fabb23f8fa2b4427d7642
UTF-8 荼居輯B}v荼居輯B}vB 11000011101010001100001010001101110000101011110011000011101001011100001010110001110000101000010111000011101010001100001010111100110000101010111101000010011111010111011011000011101010001100001010001101110000101011110011000011101001011100001010110001110000101000010111000011101010001100001010111100110000101010111101000010011111010111011001000010 c3a8c28dc2bcc3a5c2b1c285c3a8c2bcc2af427d76c3a8c28dc2bcc3a5c2b1c285c3a8c2bcc2af427d7642
UHC ??¼?±??¼?B}v??¼?±??¼?B}vB 00111111001111111010100011111001001111111010000110111110001111110011111110101000111110010011111101000010011111010111011000111111001111111010100011111001001111111010000110111110001111110011111110101000111110010011111101000010011111010111011001000010 3f3fa8f93fa1be3f3fa8f93f427d763f3fa8f93fa1be3f3fa8f93f427d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)