To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 迢ク豺。謐臥矯譌丞ュ倡矯豺。謐臥矯譌丞ュ錬 11100111100010111011100011100110101101111010000111100110100011011000100111100111100010111011100011100110100101111000111111100101101011011001100011100111100010111011100011100110101101111010000111100110100011011000100111100111100010111011100011100110100101111000111111100101101011011001100001000010 e78bb8e6b7a1e68d89e78bb8e6978fe5ad98e78bb8e6b7a1e68d89e78bb8e6978fe5ad9842
EUC-JP 迢ク豺。謐臥矯譌丞ュ倡矯豺。謐臥矯譌丞ュ錬 111011011110101110001110101110001110110010111001100011101010000111101011111011011011001011101001101101101011101011101011111101111011111011100111100011101010110111010000111010011011011010111010111011001011100110001110101000011110101111101101101100101110100110110110101110101110101111110111101111101110011110001110101011011100111110100011 edeb8eb8ecb98ea1ebedb2e9b6baebf7bee78eadd0e9b6baecb98ea1ebedb2e9b6baebf7bee78eadcfa3
UTF-8 迢ク豺。謐臥矯譌丞ュ倡矯豺。謐臥矯譌丞ュ錬 111010001011111110100010111011111011110110111000111010001011000110111010111011111011110110100001111010001010110010010000111010001000011110100101111001111001111110101111111010001010110110001100111001001011100010011110111011111011110110101101111001011000000010100001111001111001111110101111111010001011000110111010111011111011110110100001111010001010110010010000111010001000011110100101111001111001111110101111111010001010110110001100111001001011100010011110111011111011110110101101111010011000110010101100 e8bfa2efbdb8e8b1baefbda1e8ac90e887a5e79fafe8ad8ce4b89eefbdade580a1e79fafe8b1baefbda1e8ac90e887a5e79fafe8ad8ce4b89eefbdade98cac
UHC ??豺?謐臥矯?丞?倡矯豺?謐臥矯?丞?? 001111110011111111100011110011110011111111011010110011011110100011000010110011101110110000111111111000111010101000111111111100111101101111001110111011001110001111001111001111111101101011001101111010001100001011001110111011000011111111100011101010100011111100111111 3f3fe3cf3fdacde8c2ceec3fe3aa3ff3dbceece3cf3fdacde8c2ceec3fe3aa3f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)