To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN テδ陛δ甘δ禿つキテδ凝δ古δ禿つキB 110000111000001111000010100101011100001110000011110000101000101011000011100000111100001010010011110000111000001011000010101101111100001110000011110000101000101111000011100000111100001010001100110000111000001111000010100100111100001110000010110000101011011101000010 c383c295c383c28ac383c293c382c2b7c383c28bc383c28cc383c293c382c2b742
EUC-JP テδ陛δ甘δ禿つキテδ凝δ古δ禿つキB 10001110110000111010011011000100110010101100010110100110110001001011010011000101101001101100010011000110110001011010010011000100100011101011011110001110110000111010011011000100101101101100010110100110110001001011100011000101101001101100010011000110110001011010010011000100100011101011011101000010 8ec3a6c4cac5a6c4b4c5a6c4c6c5a4c48eb78ec3a6c4b6c5a6c4b8c5a6c4c6c5a4c48eb742
UTF-8 テδ陛δ甘δ禿つキテδ凝δ古δ禿つキB 11101111101111101000001111001110101101001110100110011001100110111100111010110100111001111001010010011000110011101011010011100111101001101011111111100011100000011010010011101111101111011011011111101111101111101000001111001110101101001110010110000111100111011100111010110100111001011000111110100100110011101011010011100111101001101011111111100011100000011010010011101111101111011011011101000010 efbe83ceb4e9999bceb4e79498ceb4e7a6bfe381a4efbdb7efbe83ceb4e5879dceb4e58fa4ceb4e7a6bfe381a4efbdb742
UHC ?δ陛δ甘δ禿つ??δ凝δ古δ禿つ?B 001111111010010111100100111110001100111010100101111001001100101011110110101001011110010011010100101111101010101011000100001111110011111110100101111001001110101111101010101001011110010011001101101011111010010111100100110101001011111010101010110001000011111101000010 3fa5e4f8cea5e4caf6a5e4d4beaac43f3fa5e4ebeaa5e4cdafa5e4d4beaac43f42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)