To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????^b[?????????^b[^ 00111111001111110011111100111111001111110011111100111111001111110011111101011110011000100101101100111111001111110011111100111111001111110011111100111111001111110011111101011110011000100101101101011110 3f3f3f3f3f3f3f3f3f5e625b3f3f3f3f3f3f3f3f3f5e625b5e
SJIS-WIN 永??泣???⑥?^b[永??泣???⑥?^b[^ 10001001011010010011111100111111100010111000001100111111001111110011111110000111010001010011111101011110011000100101101110001001011010010011111100111111100010111000001100111111001111110011111110000111010001010011111101011110011000100101101101011110 89693f3f8b833f3f3f87453f5e625b89693f3f8b833f3f3f87453f5e625b5e
EUC-JP 永??泣??洹??^b[永??泣??洹??^b[^ 101100011100101000111111001111111011010111100011001111110011111110001111110001111011101000111111001111110101111001100010010110111011000111001010001111110011111110110101111000110011111100111111100011111100011110111010001111110011111101011110011000100101101101011110 b1ca3f3fb5e33f3f8fc7ba3f3f5e625bb1ca3f3fb5e33f3f8fc7ba3f3f5e625b5e
UTF-8 永띔퍜泣섊독洹⑥돩^b[永띔퍜泣섊독洹⑥돩^b[^ 11100110101100001011100011101011100111011001010011101101100011011001110011100110101100111010001111101100100001001000101011101011100011111000010111100110101101001011100111100010100100011010010111101011100011111010100101011110011000100101101111100110101100001011100011101011100111011001010011101101100011011001110011100110101100111010001111101100100001001000101011101011100011111000010111100110101101001011100111100010100100011010010111101011100011111010100101011110011000100101101101011110 e6b0b8eb9d94ed8d9ce6b3a3ec848aeb8f85e6b4b9e291a5eb8fa95e625be6b0b8eb9d94ed8d9ce6b3a3ec848aeb8f85e6b4b9e291a5eb8fa95e625b5e
UHC 永띔퍜泣섊독洹⑥돩^b[永띔퍜泣섊독洹⑥돩^b[^ 11100111101101011011011011101010101110111001001111101011111010001001100011100111101101011011011011101010101101111010100011101100100010011010110001011110011000100101101111100111101101011011011011101010101110111001001111101011111010001001100011100111101101011011011011101010101101111010100011101100100010011010110001011110011000100101101101011110 e7b5b6eabb93ebe898e7b5b6eab7a8ec89ac5e625be7b5b6eabb93ebe898e7b5b6eab7a8ec89ac5e625b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)