To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??nkf??nk^}Y??nkf??nk^}bE 00111111001111110110111001101011011001100011111100111111011011100110101101011110011111010101100100111111001111110110111001101011011001100011111100111111011011100110101101011110011111010110001001000101 3f3f6e6b663f3f6e6b5e7d593f3f6e6b663f3f6e6b5e7d6245
SJIS-WIN 歎尊nkf歎尊nk^}Y歎尊nkf歎尊nk^}bE 100100100101011010010001101110000110111001101011011001101001001001010110100100011011100001101110011010110101111001111101010110011001001001010110100100011011100001101110011010110110011010010010010101101001000110111000011011100110101101011110011111010110001001000101 925691b86e6b66925691b86e6b5e7d59925691b86e6b66925691b86e6b5e7d6245
EUC-JP 歎尊nkf歎尊nk^}Y歎尊nkf歎尊nk^}bE 110000111011011111000010101110100110111001101011011001101100001110110111110000101011101001101110011010110101111001111101010110011100001110110111110000101011101001101110011010110110011011000011101101111100001010111010011011100110101101011110011111010110001001000101 c3b7c2ba6e6b66c3b7c2ba6e6b5e7d59c3b7c2ba6e6b66c3b7c2ba6e6b5e7d6245
UTF-8 歎尊nkf歎尊nk^}Y歎尊nkf歎尊nk^}bE 1110011010101101100011101110010110110000100010100110111001101011011001101110011010101101100011101110010110110000100010100110111001101011010111100111110101011001111001101010110110001110111001011011000010001010011011100110101101100110111001101010110110001110111001011011000010001010011011100110101101011110011111010110001001000101 e6ad8ee5b08a6e6b66e6ad8ee5b08a6e6b5e7d59e6ad8ee5b08a6e6b66e6ad8ee5b08a6e6b5e7d6245
UHC 歎尊nkf歎尊nk^}Y歎尊nkf歎尊nk^}bE 111101111010011111110000111011100110111001101011011001101111011110100111111100001110111001101110011010110101111001111101010110011111011110100111111100001110111001101110011010110110011011110111101001111111000011101110011011100110101101011110011111010110001001000101 f7a7f0ee6e6b66f7a7f0ee6e6b5e7d59f7a7f0ee6e6b66f7a7f0ee6e6b5e7d6245

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)