To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????n}????????n{^ 001111110011111100111111001111110011111100111111001111110011111101101110011111010011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 乳?d乳??哀糞n}乳?d乳??哀糞n{^ 10010011111110110011111110000010100001001001001111111011001111110011111110001000101000111001010110110011011011100111110110010011111110110011111110000010100001001001001111111011001111110011111110001000101000111001010110110011011011100111101101011110 93fb3f828493fb3f3f88a395b36e7d93fb3f828493fb3f3f88a395b36e7b5e
EUC-JP 乳?d乳??哀糞n}乳?d乳??哀糞n{^ 11000110111111010011111110100011111001001100011011111101001111110011111110110000101001011100101010110101011011100111110111000110111111010011111110100011111001001100011011111101001111110011111110110000101001011100101010110101011011100111101101011110 c6fd3fa3e4c6fd3f3fb0a5cab56e7dc6fd3fa3e4c6fd3f3fb0a5cab56e7b5e
UTF-8 乳㏘d乳㏘쨩哀糞n}乳㏘d乳㏘쨩哀糞n{^ 1110010010111001101100111110001110001111100110001110111110111101100001001110010010111001101100111110001110001111100110001110110010101000101010011110010110010011100000001110011110110011100111100110111001111101111001001011100110110011111000111000111110011000111011111011110110000100111001001011100110110011111000111000111110011000111011001010100010101001111001011001001110000000111001111011001110011110011011100111101101011110 e4b9b3e38f98efbd84e4b9b3e38f98eca8a9e59380e7b39e6e7de4b9b3e38f98efbd84e4b9b3e38f98eca8a9e59380e7b39e6e7b5e
UHC 乳㏘d乳㏘쨩哀糞n}乳㏘d乳㏘쨩哀糞n{^ 11101010111000011010001011100100101000111110010011101010111000011010001011100100110000101011101111100100111011101101110111010000011011100111110111101010111000011010001011100100101000111110010011101010111000011010001011100100110000101011101111100100111011101101110111010000011011100111101101011110 eae1a2e4a3e4eae1a2e4c2bbe4eeddd06e7deae1a2e4a3e4eae1a2e4c2bbe4eeddd06e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)