To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 夜??飮??矣??永?〕泣??源??驛 100101101110100100111111001111111001111101011010001111110011111111100001111000010011111100111111100010010110100100111111100000010110110010001011100000110011111100111111100011001011100100111111001111111110100110000011 96e93f3f9f5a3f3fe1e13f3f89693f816c8b833f3f8cb93f3fe983
EUC-JP 夜??飮??矣??永?〕泣??源??驛 110011001110101100111111001111111101110110111011001111110011111111100010111000110011111100111111101100011100101000111111101000011100110110110101111000110011111100111111101110001011101100111111001111111111000111100011 cceb3f3fddbb3f3fe2e33f3fb1ca3fa1cdb5e33f3fb8bb3f3ff1e3
UTF-8 夜껊씛飮꿩퓴矣묒뒟永띕〕泣됪썫源띿뒯驛 111001011010010010011100111010101011101110001010111011001001010010011011111010011010001110101110111010101011111110101001111011011001001110110100111001111001111110100011111010111010110010010010111010111001001010011111111001101011000010111000111010111001110110010101111000111000000010010101111001101011001110100011111010111001000010101010111011001000110110101011111001101011101010010000111010111001110110111111111010111001001010101111111010011010100110011011 e5a49ceabb8aec949be9a3aeeabfa9ed93b4e79fa3ebac92eb929fe6b0b8eb9d95e38095e6b3a3eb90aaec8dabe6ba90eb9dbfeb92afe9a99b
UHC 夜껊씛飮꿩퓴矣묒뒟永띕〕泣됪썫源띿뒯驛 1110010110101000100000111110101110011101101100001110101111100110101100101110011010111111100110101110101111111000100100011110110010001010100110111110011110110101101101101110101110100001101100111110101111101000100010011110011010011011100111001110101010111001100011011110110010001010101010001110011010111110 e5a883eb9db0ebe6b2e6bf9aebf891ec8a9be7b5b6eba1b3ebe889e69b9ceab98dec8aa8e6be

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)