To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 荼居輯n}v荼居輯n}vB 11101000100011011011110011100101101100011000010111101000101111001010111101101110011111010111011011101000100011011011110011100101101100011000010111101000101111001010111101101110011111010111011001000010 e88dbce5b185e8bcaf6e7d76e88dbce5b185e8bcaf6e7d7642
SJIS-WIN ????±????n}v????±????n}vB 001111110011111100111111001111111000000101111101001111110011111100111111001111110110111001111101011101100011111100111111001111110011111110000001011111010011111100111111001111110011111101101110011111010111011001000010 3f3f3f3f817d3f3f3f3f6e7d763f3f3f3f817d3f3f3f3f6e7d7642
EUC-JP è??å±?è?¯n}vè??å±?è?¯n}vB 10001111101010111011001000111111001111111000111110101011101010011010000111011110001111111000111110101011101100100011111110001111101000101011010001101110011111010111011010001111101010111011001000111111001111111000111110101011101010011010000111011110001111111000111110101011101100100011111110001111101000101011010001101110011111010111011001000010 8fabb23f3f8faba9a1de3f8fabb23f8fa2b46e7d768fabb23f3f8faba9a1de3f8fabb23f8fa2b46e7d7642
UTF-8 荼居輯n}v荼居輯n}vB 11000011101010001100001010001101110000101011110011000011101001011100001010110001110000101000010111000011101010001100001010111100110000101010111101101110011111010111011011000011101010001100001010001101110000101011110011000011101001011100001010110001110000101000010111000011101010001100001010111100110000101010111101101110011111010111011001000010 c3a8c28dc2bcc3a5c2b1c285c3a8c2bcc2af6e7d76c3a8c28dc2bcc3a5c2b1c285c3a8c2bcc2af6e7d7642
UHC ??¼?±??¼?n}v??¼?±??¼?n}vB 00111111001111111010100011111001001111111010000110111110001111110011111110101000111110010011111101101110011111010111011000111111001111111010100011111001001111111010000110111110001111110011111110101000111110010011111101101110011111010111011001000010 3f3fa8f93fa1be3f3fa8f93f6e7d763f3fa8f93fa1be3f3fa8f93f6e7d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)