To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 蒻れ?泣??淫??嚴щ?愉??矣??? 111001001110100010000010111010100011111110001011100000110011111100111111100010001111101000111111001111111001101010001110100001001000101100111111100101101111100100111111001111111110000111100001001111110011111100111111 e4e882ea3f8b833f3f88fa3f3f9a8e848b3f96f93f3fe1e13f3f3f
EUC-JP 蒻れ?泣??淫??嚴щ?愉??矣??孼 1110100011101010101001001110110000111111101101011110001100111111001111111011000011111100001111110011111111010011111011101010011111101011001111111100110011111011001111110011111111100010111000110011111100111111100011111011101011000011 e8eaa4ec3fb5e33f3fb0fc3f3fd3eea7eb3fccfb3f3fe2e33f3f8fbac3
UTF-8 蒻れ슦泣길룚淫됱숱嚴щ벊愉녑땔矣곗뒻孼 1110100010010010101110111110001110000010100011001110110010001010101001101110011010110011101000111110101010111000101110001110101110100011100110101110011010110111101010111110101110010000101100011110110010001000101100011110010110011010101101001101000110001001111010111011001010001010111001101000010010001001111010111000010110010001111010111001010110010100111001111001111110100011111010101011001110010111111010111001001010111011111001011010110110111100 e892bbe3828cec8aa6e6b3a3eab8b8eba39ae6b7abeb90b1ec88b1e59ab4d189ebb28ae68489eb8591eb9594e79fa3eab397eb92bbe5adbc
UHC 蒻れ슦泣길룚淫됱숱嚴щ벊愉녑땔矣곗뒻孼 1110010110110110101010101110110010011010101100001110101111101000101100011110011010001111100101101110101111100010100010011110110010111101101000101110010111110001101011001110101110010011101011011110101011110000101100111110010110110110101010101110101111111000101100001110110010001010101100011110010111101101 e5b6aaec9ab0ebe8b1e68f96ebe289ecbda2e5f1aceb93adeaf0b3e5b6aaebf8b0ec8ab1e5ed

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)