To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 筌?????醫??冶⑤??э?袁??筌 1110001010100011001111110011111100111111001111110011111111100111110011100011111100111111100101101110100010000111010001000011111100111111100001001000111100111111111001011100110100111111001111111110001010100011 e2a33f3f3f3f3fe7ce3f3f96e887443f3f848f3fe5cd3f3fe2a3
EUC-JP 筌™????醫??冶?Ŋ彛э?袁??筌 11100100101001011000111110100010111011110011111100111111001111110011111111101110110100000011111100111111110011001110101000111111100011111010100110101011100011111011110011111010101001111110111100111111111010101100111100111111001111111110010010100101 e4a58fa2ef3f3f3f3feed03f3fccea3f8fa9ab8fbcfaa7ef3feacf3f3fe4a5
UTF-8 筌™뫁李볩㏊醫롮퐧冶⑤Ŋ彛э㎖袁⑷갭筌 11100111101011011000110011100010100001001010001011101011101010111000000111101111101001111010000111101011101100111010100111100011100011111000101011101001100001101010101111101011101000011010111011101101100100001010011111100101100001101011011011100010100100011010010011000101100010101110010110111101100110111101000110001101111000111000111010010110111010001010001010000001111000101001000110110111111010101011000010101101111001111010110110001100 e7ad8ce284a2ebab81efa7a1ebb3a9e38f8ae986abeba1aeed90a7e586b6e291a4c58ae5bd9bd18de38e96e8a281e291b7eab0ade7ad8c
UHC 筌™뫁李볩㏊醫롮퐧冶⑤Ŋ彛э㎖袁⑷갭筌 1110111110100111101000101110001010010001101001011110110010110000100100111110111110100111101101011110110010100010100011101110110010111101100100001110010110100111101010001110101110101000101011111110110010101101101011001110111110100111101000101110101010111110101010011110101010110000101110001110111110100111 efa7a2e291a5ecb093efa7b5eca28eecbd90e5a7a8eba8afecadacefa7a2eabea9eab0b8efa7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)