To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??M??????????M???????? 00111111001111110100110100111111001111110011111100111111001111110011111100111111001111110011111100111111010011010011111100111111001111110011111100111111001111110011111100111111 3f3f4d3f3f3f3f3f3f3f3f3f3f4d3f3f3f3f3f3f3f3f
SJIS-WIN 偲悉M篠耳偲悉篠ワト式偲悉M篠耳偲而篠゙ト璽 1000111011000011100011101011101101001101100011101100001010001110101010001000111011000011100011101011101110001110110000101101110011000100100011101010111010001110110000111000111010111011010011011000111011000010100011101010100010001110110000111000111010100111100011101100001011011110110001001000111010100011 8ec38ebb4d8ec28ea88ec38ebb8ec2dcc48eae8ec38ebb4d8ec28ea88ec38ea78ec2dec48ea3
EUC-JP 偲悉M篠耳偲悉篠ワト式偲悉M篠耳偲而篠゙ト璽 101111001100010110111100101111010100110110111100110001001011110010101010101111001100010110111100101111011011110011000100100011101101110010001110110001001011110010110000101111001100010110111100101111010100110110111100110001001011110010101010101111001100010110111100101010011011110011000100100011101101111010001110110001001011110010100101 bcc5bcbd4dbcc4bcaabcc5bcbdbcc48edc8ec4bcb0bcc5bcbd4dbcc4bcaabcc5bca9bcc48ede8ec4bca5
UTF-8 偲悉M篠耳偲悉篠ワト式偲悉M篠耳偲而篠゙ト璽 1110010110000001101100101110011010000010100010010100110111100111101011111010000011101000100000001011001111100101100000011011001011100110100000101000100111100111101011111010000011101111101111101001110011101111101111101000010011100101101111001000111111100101100000011011001011100110100000101000100101001101111001111010111110100000111010001000000010110011111001011000000110110010111010001000000010001100111001111010111110100000111011111011111010011110111011111011111010000100111001111001001010111101 e581b2e682894de7afa0e880b3e581b2e68289e7afa0efbe9cefbe84e5bc8fe581b2e682894de7afa0e880b3e581b2e8808ce7afa0efbe9eefbe84e792bd
UHC ?悉M篠耳?悉篠??式?悉M篠耳?而篠??璽 00111111111000111111101001001101111000011100011011101100101111000011111111100011111110101110000111000110001111110011111111100011110100100011111111100011111110100100110111100001110001101110110010111100001111111110110010111011111000011100011000111111001111111101111111011110 3fe3fa4de1c6ecbc3fe3fae1c63f3fe3d23fe3fa4de1c6ecbc3fecbbe1c63f3fdfde

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)