To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 偲示偲雫篠疾偲式篠カト治偲痔偲鹿篠竺B 1000111011000011100011101010011010001110110000111000111010110100100011101100001010001110101111101000111011000011100011101010111010001110110000101011011011000100100011101010000110001110110000111000111010100100100011101100001110001110101011011000111011000010100011101011000101000010 8ec38ea68ec38eb48ec28ebe8ec38eae8ec2b6c48ea18ec38ea48ec38ead8ec28eb142
EUC-JP 偲示偲雫篠疾偲式篠カト治偲痔偲鹿篠竺B 10111100110001011011110010101000101111001100010110111100101101101011110011000100101111001100000010111100110001011011110010110000101111001100010010001110101101101000111011000100101111001010001110111100110001011011110010100110101111001100010110111100101011111011110011000100101111001011001101000010 bcc5bca8bcc5bcb6bcc4bcc0bcc5bcb0bcc48eb68ec4bca3bcc5bca6bcc5bcafbcc4bcb342
UTF-8 偲示偲雫篠疾偲式篠カト治偲痔偲鹿篠竺B 11100101100000011011001011100111101001001011101011100101100000011011001011101001100110111010101111100111101011111010000011100111100101101011111011100101100000011011001011100101101111001000111111100111101011111010000011101111101111011011011011101111101111101000010011100110101100101011101111100101100000011011001011100111100101111001010011100101100000011011001011101001101110011011111111100111101011111010000011100111101010111011101001000010 e581b2e7a4bae581b2e99babe7afa0e796bee581b2e5bc8fe7afa0efbdb6efbe84e6b2bbe581b2e79794e581b2e9b9bfe7afa0e7abba42
UHC ?示??篠疾?式篠??治?痔?鹿篠竺B 0011111111100011110001100011111100111111111000011100011011110010111100000011111111100011110100101110000111000110001111110011111111110110101111010011111111110110110000000011111111010110111000111110000111000110111101011110011101000010 3fe3c63f3fe1c6f2f03fe3d2e1c63f3ff6bd3ff6c03fd6e3e1c6f5e742

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)