To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 橈??油??矣??巍リ?援??純??娃 100111101111010000111111001111111001011011111011001111110011111111100001111000010011111100111111100110111101100110000011100010100011111110001001100001110011111100111111100011111000001100111111001111111000100010100001 9ef43f3f96fb3f3fe1e13f3f9bd9838a3f89873f3f8f833f3f88a1
EUC-JP 橈??油??矣??巍リ?援??純??娃 110111001111011000111111001111111100110011111101001111110011111111100010111000110011111100111111110101101101101110100101111010100011111110110001111001110011111100111111101111011110001100111111001111111011000010100011 dcf63f3fccfd3f3fe2e33f3fd6dba5ea3fb1e73f3fbde33f3fb0a3
UTF-8 橈볥굝油꾣꼷矣꾨쨨巍リ랩援졾윜純볤컜娃 111001101010100110001000111010111011001110100101111010101011010110011101111001101011001010111001111010101011111010100011111010101011110010110111111001111001111110100011111010101011111010101000111011001010100010101000111001011011011110001101111000111000001110101010111010111001111010101001111001101000111110110100111011001010000110111110111011001001110010011100111001111011010010010100111010111011001110100100111011001011101110011100111001011010100010000011 e6a988ebb3a5eab59de6b2b9eabea3eabcb7e79fa3eabea8eca8a8e5b78de383aaeb9ea9e68fb4eca1beec9c9ce7b494ebb3a4ecbb9ce5a883
UHC 橈볥굝油꾣꼷矣꾨쨨巍リ랩援졾윜純볤컜娃 1110100011111010100100111110101110000010100001011110101011111010100001001110011010000100100011111110101111111000100001001110101110100100100000111110100011100100101010111110101010110111101001101110101010110101101000001110010110011111100111111110001011101101100100111110101010110000100001111110100011011111 e8fa93eb8285eafa84e6848febf884eba483e8e4abeab7a6eab5a0e59f9fe2ed93eab087e8df

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)