To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????{N}????????{N{^ 0011111100111111001111110011111100111111001111110011111100111111011110110100111001111101001111110011111100111111001111110011111100111111001111110011111101111011010011100111101101011110 3f3f3f3f3f3f3f3f7b4e7d3f3f3f3f3f3f3f3f7b4e7b5e
SJIS-WIN 偲辞偲磁篠シト式{N}偲辞偲磁篠シト式{N{^ 1000111011000011100011101010101110001110110000111000111010100101100011101100001010111100110001001000111010101110011110110100111001111101100011101100001110001110101010111000111011000011100011101010010110001110110000101011110011000100100011101010111001111011010011100111101101011110 8ec38eab8ec38ea58ec2bcc48eae7b4e7d8ec38eab8ec38ea58ec2bcc48eae7b4e7b5e
EUC-JP 偲辞偲磁篠シト式{N}偲辞偲磁篠シト式{N{^ 101111001100010110111100101011011011110011000101101111001010011110111100110001001000111010111100100011101100010010111100101100000111101101001110011111011011110011000101101111001010110110111100110001011011110010100111101111001100010010001110101111001000111011000100101111001011000001111011010011100111101101011110 bcc5bcadbcc5bca7bcc48ebc8ec4bcb07b4e7dbcc5bcadbcc5bca7bcc48ebc8ec4bcb07b4e7b5e
UTF-8 偲辞偲磁篠シト式{N}偲辞偲磁篠シト式{N{^ 11100101100000011011001011101000101111101001111011100101100000011011001011100111101000111000000111100111101011111010000011101111101111011011110011101111101111101000010011100101101111001000111101111011010011100111110111100101100000011011001011101000101111101001111011100101100000011011001011100111101000111000000111100111101011111010000011101111101111011011110011101111101111101000010011100101101111001000111101111011010011100111101101011110 e581b2e8be9ee581b2e7a381e7afa0efbdbcefbe84e5bc8f7b4e7de581b2e8be9ee581b2e7a381e7afa0efbdbcefbe84e5bc8f7b4e7b5e
UHC ???磁篠??式{N}???磁篠??式{N{^ 0011111100111111001111111110110110111000111000011100011000111111001111111110001111010010011110110100111001111101001111110011111100111111111011011011100011100001110001100011111100111111111000111101001001111011010011100111101101011110 3f3f3fedb8e1c63f3fe3d27b4e7d3f3f3fedb8e1c63f3fe3d27b4e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)