To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ???泣??純????????泣??純?????B 0011111100111111001111111000101110000011001111110011111110001111100000110011111100111111001111110011111100111111001111110011111100111111100010111000001100111111001111111000111110000011001111110011111100111111001111110011111101000010 3f3f3f8b833f3f8f833f3f3f3f3f3f3f3f8b833f3f8f833f3f3f3f3f42
EUC-JP ???泣??純????????泣??純?????B 0011111100111111001111111011010111100011001111110011111110111101111000110011111100111111001111110011111100111111001111110011111100111111101101011110001100111111001111111011110111100011001111110011111100111111001111110011111101000010 3f3f3fb5e33f3fbde33f3f3f3f3f3f3f3fb5e33f3fbde33f3f3f3f3f42
UTF-8 閱륁슱泣덅쫨純섑뜟咽됰씀閱륁슱泣덅쫨純섑뜟咽됰씀B 11101001100101101011000111101011101001011000000111101100100010101011000111100110101100111010001111101011100011011000010111101100101010111010100011100111101101001001010011101100100001001001000111101011100111001001111111101111101001101001111011101011100100001011000011101100100101001000000011101001100101101011000111101011101001011000000111101100100010101011000111100110101100111010001111101011100011011000010111101100101010111010100011100111101101001001010011101100100001001001000111101011100111001001111111101111101001101001111011101011100100001011000011101100100101001000000001000010 e996b1eba581ec8ab1e6b3a3eb8d85ecaba8e7b494ec8491eb9c9fefa69eeb90b0ec9480e996b1eba581ec8ab1e6b3a3eb8d85ecaba8e7b494ec8491eb9c9fefa69eeb90b0ec948042
UHC 閱륁슱泣덅쫨純섑뜟咽됰씀閱륁슱泣덅쫨純섑뜟咽됰씀B 11100110111100111000111111101100100110101011100011101011111010001000100011101000101001101000000111100010111011011001100011101101100011011010001011100110111011001000100111101011101111101011100011100110111100111000111111101100100110101011100011101011111010001000100011101000101001101000000111100010111011011001100011101101100011011010001011100110111011001000100111101011101111101011100001000010 e6f38fec9ab8ebe888e8a681e2ed98ed8da2e6ec89ebbeb8e6f38fec9ab8ebe888e8a681e2ed98ed8da2e6ec89ebbeb842

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)