To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????h????????? 00111111001111110011111100111111001111110011111100111111001111110011111101101000001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f683f3f3f3f3f3f3f3f3f
SJIS-WIN 筌≪?揖??醫??h筌≪?揖??醫?? 111000101010001110000001111000010011111110010111010010110011111100111111111001111100111000111111001111110110100011100010101000111000000111100001001111111001011101001011001111110011111111100111110011100011111100111111 e2a381e13f974b3f3fe7ce3f3f68e2a381e13f974b3f3fe7ce3f3f
EUC-JP 筌≪?揖??醫??h筌≪?揖??醫?? 111001001010010110100010111000110011111111001101101011000011111100111111111011101101000000111111001111110110100011100100101001011010001011100011001111111100110110101100001111110011111111101110110100000011111100111111 e4a5a2e33fcdac3f3feed03f3f68e4a5a2e33fcdac3f3feed03f3f
UTF-8 筌≪눛揖깍㎗醫륁뒉h筌≪눛揖깍㎗醫륁뒉 11100111101011011000110011100010100010011010101011101011100010001001101111100110100011111001011011101010101110011000110111100011100011101001011111101001100001101010101111101011101001011000000111101011100100101000100101101000111001111010110110001100111000101000100110101010111010111000100010011011111001101000111110010110111010101011100110001101111000111000111010010111111010011000011010101011111010111010010110000001111010111001001010001001 e7ad8ce289aaeb889be68f96eab98de38e97e986abeba581eb928968e7ad8ce289aaeb889be68f96eab98de38e97e986abeba581eb9289
UHC 筌≪눛揖깍㎗醫륁뒉h筌≪눛揖깍㎗醫륁뒉 11101111101001111010000111101100100001111011001111101011111001111011000111101111101001111010001111101100101000101000111111101100100010101000011001101000111011111010011110100001111011001000011110110011111010111110011110110001111011111010011110100011111011001010001010001111111011001000101010000110 efa7a1ec87b3ebe7b1efa7a3eca28fec8a8668efa7a1ec87b3ebe7b1efa7a3eca28fec8a86

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)