To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???Tkf???Tk^}Y???Tkf???Tk^}bE 0011111100111111001111110101010001101011011001100011111100111111001111110101010001101011010111100111110101011001001111110011111100111111010101000110101101100110001111110011111100111111010101000110101101011110011111010110001001000101 3f3f3f546b663f3f3f546b5e7d593f3f3f546b663f3f3f546b5e7d6245
SJIS-WIN 杓爾貉Tkf杓爾貉Tk^}Y杓爾貉Tkf杓爾貉Tk^}bE 1000111011011011100011101010001011100110101110010101010001101011011001101000111011011011100011101010001011100110101110010101010001101011010111100111110101011001100011101101101110001110101000101110011010111001010101000110101101100110100011101101101110001110101000101110011010111001010101000110101101011110011111010110001001000101 8edb8ea2e6b9546b668edb8ea2e6b9546b5e7d598edb8ea2e6b9546b668edb8ea2e6b9546b5e7d6245
EUC-JP 杓爾貉Tkf杓爾貉Tk^}Y杓爾貉Tkf杓爾貉Tk^}bE 1011110011011101101111001010010011101100101110110101010001101011011001101011110011011101101111001010010011101100101110110101010001101011010111100111110101011001101111001101110110111100101001001110110010111011010101000110101101100110101111001101110110111100101001001110110010111011010101000110101101011110011111010110001001000101 bcddbca4ecbb546b66bcddbca4ecbb546b5e7d59bcddbca4ecbb546b66bcddbca4ecbb546b5e7d6245
UTF-8 杓爾貉Tkf杓爾貉Tk^}Y杓爾貉Tkf杓爾貉Tk^}bE 1110011010011101100100111110011110001000101111101110100010110010100010010101010001101011011001101110011010011101100100111110011110001000101111101110100010110010100010010101010001101011010111100111110101011001111001101001110110010011111001111000100010111110111010001011001010001001010101000110101101100110111001101001110110010011111001111000100010111110111010001011001010001001010101000110101101011110011111010110001001000101 e69d93e788bee8b289546b66e69d93e788bee8b289546b5e7d59e69d93e788bee8b289546b66e69d93e788bee8b289546b5e7d6245
UHC 杓爾?Tkf杓爾?Tk^}Y杓爾?Tkf杓爾?Tk^}bE 11111000111101011110110010110011001111110101010001101011011001101111100011110101111011001011001100111111010101000110101101011110011111010101100111111000111101011110110010110011001111110101010001101011011001101111100011110101111011001011001100111111010101000110101101011110011111010110001001000101 f8f5ecb33f546b66f8f5ecb33f546b5e7d59f8f5ecb33f546b66f8f5ecb33f546b5e7d6245

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)