To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 哀???筌??誼▼?猷???レ?誼??邑??B 10001000101000110011111100111111001111111110001010100011001111110011111110001011011000101000000110100101001111111001011101010001001111110011111100111111100000111000110000111111100010110110001000111111001111111001011101010111001111110011111101000010 88a33f3f3fe2a33f3f8b6281a53f97513f3f3f838c3f8b623f3f97573f3f42
EUC-JP 哀???筌??誼▼?猷???レ?誼??邑??B 10110000101001010011111100111111001111111110010010100101001111110011111110110101110000111010001010100111001111111100110110110010001111110011111100111111101001011110110000111111101101011100001100111111001111111100110110111000001111110011111101000010 b0a53f3f3fe4a53f3fb5c3a2a73fcdb23f3f3fa5ec3fb5c33f3fcdb83f3f42
UTF-8 哀읪딄퐥筌뗪퉭誼▼맅猷밴틕曆レ쉶誼울쭓邑녠틧B 11100101100100111000000011101100100111011010101011101011100101001000010011101101100100001010010111100111101011011000110011101011100101111010101011101101100010011010110111101000101010101011110011100010100101101011110011101011101001111000010111100111100011001011011111101011101100001011010011101101100010111001010111101111101001101000101111100011100000111010110011101100100010011011011011101000101010101011110011101100100110101011100011101100101011011001001111101001100000101001000111101011100001011010000011101101100010111010011101000010 e59380ec9daaeb9484ed90a5e7ad8ceb97aaed89ade8aabce296bceba785e78cb7ebb0b4ed8b95efa68be383acec89b6e8aabcec9ab8ecad93e98291eb85a0ed8ba742
UHC 哀읪딄퐥筌뗪퉭誼▼맅猷밴틕曆レ쉶誼울쭓邑녠틧B 111001001110111010011111110100011000101011101010101111011000111011101111101001111000101111101010101110011000010111101011111111101010000111100101100100001001111111101011101000111011100111101010101110101000001111100110101101111010101111101100100110101000110011101011111111101011111111101111101001111000101111101011111010011011001111101010101110101001000101000010 e4ee9fd18aeabd8eefa78beab985ebfea1e5909feba3b9eaba83e6b7abec9a8cebfebfefa78bebe9b3eaba9142

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)