To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????}v????????}vB 001111110011111100111111001111110011111100111111001111110011111101111101011101100011111100111111001111110011111100111111001111110011111100111111011111010111011001000010 3f3f3f3f3f3f3f3f7d763f3f3f3f3f3f3f3f7d7642
SJIS-WIN 旭??頭??奚?}v旭??頭??奚?}vB 100010001010111000111111001111111001001110101010001111110011111110011010111101100011111101111101011101101000100010101110001111110011111110010011101010100011111100111111100110101111011000111111011111010111011001000010 88ae3f3f93aa3f3f9af63f7d7688ae3f3f93aa3f3f9af63f7d7642
EUC-JP 旭?祜頭??奚?}v旭?祜頭??奚?}vB 10110000101100000011111110001111110100001101100011000110101011000011111100111111110101001111100000111111011111010111011010110000101100000011111110001111110100001101100011000110101011000011111100111111110101001111100000111111011111010111011001000010 b0b03f8fd0d8c6ac3f3fd4f83f7d76b0b03f8fd0d8c6ac3f3fd4f83f7d7642
UTF-8 旭렔祜頭렖렕奚렪}v旭렔祜頭렖렕奚렪}vB 1110011010010111101011011110101110100000100101001110011110100101100111001110100110100000101011011110101110100000100101101110101110100000100101011110010110100101100110101110101110100000101010100111110101110110111001101001011110101101111010111010000010010100111001111010010110011100111010011010000010101101111010111010000010010110111010111010000010010101111001011010010110011010111010111010000010101010011111010111011001000010 e697adeba094e7a59ce9a0adeba096eba095e5a59aeba0aa7d76e697adeba094e7a59ce9a0adeba096eba095e5a59aeba0aa7d7642
UHC 旭렔祜頭렖렕奚렪}v旭렔祜頭렖렕奚렪}vB 11101001111011111000111010101001111110111101010011010100111010011000111010101011100011101010101011111010101010001000111010111000011111010111011011101001111011111000111010101001111110111101010011010100111010011000111010101011100011101010101011111010101010001000111010111000011111010111011001000010 e9ef8ea9fbd4d4e98eab8eaafaa88eb87d76e9ef8ea9fbd4d4e98eab8eaafaa88eb87d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)