To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 俑??揖??儀??壓???g?惟??鴉 1001100011011010001111110011111110010111010010110011111100111111100010110101011000111111001111111001101011011000001111110011111100111111100000101000011100111111100010001101001000111111001111111110100111101011 98da3f3f974b3f3f8b563f3f9ad83f3f3f82873f88d23f3fe9eb
EUC-JP 俑??揖??儀??壓??沅g?惟??鴉 11010000110111000011111100111111110011011010110000111111001111111011010110110111001111110011111111010100110110100011111100111111100011111100011011101001101000111110011100111111101100001101010000111111001111111111001011101101 d0dc3f3fcdac3f3fb5b73f3fd4da3f3f8fc6e9a3e73fb0d43f3ff2ed
UTF-8 俑앹늾揖닷렘儀먮솿壓믪궇沅g춯惟곕늅鴉 111001001011111110010001111011001001010110111001111010111000101010111110111001101000111110010110111010111000101110110111111010111010000010011000111001011000010010000000111010111010100010101110111011001000011010111111111001011010001110010011111010111010111110101010111010101011011010000111111001101011001010000101111011111011110110000111111011001011011010101111111001101000001110011111111010101011001110010101111010111000101010000101111010011011010010001001 e4bf91ec95b9eb8abee68f96eb8bb7eba098e58480eba8aeec86bfe5a393ebafaaeab687e6b285efbd87ecb6afe6839feab395eb8a85e9b489
UHC 俑앹늾揖닷렘儀먮솿壓믪궇沅g춯惟곕늅鴉 1110100110110101100111011110110010001000100001111110101111100111101101001110010110110111101111011110101111110000100100001110101110011001101100111110010011100010100100101110110010000010101000001110101010110110101000111110011110101101100011001110101011101110101100001110101110110100101111101110010010111100 e9b59dec8887ebe7b4e5b7bdebf090eb99b3e4e292ec82a0eab6a3e7ad8ceaeeb0ebb4bee4bc

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)