To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????n}????????n{^ 001111110011111100111111001111110011111100111111001111110011111101101110011111010011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 寀フ鯊シ岦ワ鮓」n}寀フ鯊シ岦ワ鮓」n{^ 1111101010100111110011001110100111000000101111001111101010101100110111001110100110110110101000110110111001111101111110101010011111001100111010011100000010111100111110101010110011011100111010011011011010100011011011100111101101011110 faa7cce9c0bcfaacdce9b6a36e7dfaa7cce9c0bcfaacdce9b6a36e7b5e
EUC-JP 寀フ鯊シ岦ワ鮓」n}寀フ鯊シ岦ワ鮓」n{^ 1000111110111010110110111000111011001100111100101100001010001110101111001000111110111011101100111000111011011100111100101011100010001110101000110110111001111101100011111011101011011011100011101100110011110010110000101000111010111100100011111011101110110011100011101101110011110010101110001000111010100011011011100111101101011110 8fbadb8eccf2c28ebc8fbbb38edcf2b88ea36e7d8fbadb8eccf2c28ebc8fbbb38edcf2b88ea36e7b5e
UTF-8 寀フ鯊シ岦ワ鮓」n}寀フ鯊シ岦ワ鮓」n{^ 1110010110101111100000001110111110111110100011001110100110101111100010101110111110111101101111001110010110110010101001101110111110111110100111001110100110101110100100111110111110111101101000110110111001111101111001011010111110000000111011111011111010001100111010011010111110001010111011111011110110111100111001011011001010100110111011111011111010011100111010011010111010010011111011111011110110100011011011100111101101011110 e5af80efbe8ce9af8aefbdbce5b2a6efbe9ce9ae93efbda36e7de5af80efbe8ce9af8aefbdbce5b2a6efbe9ce9ae93efbda36e7b5e
UHC 寀???????n}寀???????n{^ 1111001111110010001111110011111100111111001111110011111100111111001111110110111001111101111100111111001000111111001111110011111100111111001111110011111100111111011011100111101101011110 f3f23f3f3f3f3f3f3f6e7df3f23f3f3f3f3f3f3f6e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)