To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????B{N}??????B{N{^ 001111110011111100111111001111110011111100111111010000100111101101001110011111010011111100111111001111110011111100111111001111110100001001111011010011100111101101011110 3f3f3f3f3f3f427b4e7d3f3f3f3f3f3f427b4e7b5e
SJIS-WIN 淡但束誰叩坦B{N}淡但束誰叩坦B{N{^ 100100100101011110010010010000011001000110101001100100100100111010010010010000001001001001010010010000100111101101001110011111011001001001010111100100100100000110010001101010011001001001001110100100100100000010010010010100100100001001111011010011100111101101011110 9257924191a9924e92409252427b4e7d9257924191a9924e92409252427b4e7b5e
EUC-JP 淡但束誰叩坦B{N}淡但束誰叩坦B{N{^ 110000111011100011000011101000101100001010101011110000111010111111000011101000011100001110110011010000100111101101001110011111011100001110111000110000111010001011000010101010111100001110101111110000111010000111000011101100110100001001111011010011100111101101011110 c3b8c3a2c2abc3afc3a1c3b3427b4e7dc3b8c3a2c2abc3afc3a1c3b3427b4e7b5e
UTF-8 淡但束誰叩坦B{N}淡但束誰叩坦B{N{^ 111001101011011110100001111001001011110110000110111001101001110110011111111010001010101010110000111001011000111110101001111001011001110110100110010000100111101101001110011111011110011010110111101000011110010010111101100001101110011010011101100111111110100010101010101100001110010110001111101010011110010110011101101001100100001001111011010011100111101101011110 e6b7a1e4bd86e69d9fe8aab0e58fa9e59da6427b4e7de6b7a1e4bd86e69d9fe8aab0e58fa9e59da6427b4e7b5e
UHC 淡但束誰叩坦B{N}淡但束誰叩坦B{N{^ 110100111011111111010011101000111110000111010110111000101100000111001101101100001111011110100100010000100111101101001110011111011101001110111111110100111010001111100001110101101110001011000001110011011011000011110111101001000100001001111011010011100111101101011110 d3bfd3a3e1d6e2c1cdb0f7a4427b4e7dd3bfd3a3e1d6e2c1cdb0f7a4427b4e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)