To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????Lh?????????L 001111110011111100111111001111110011111100111111001111110011111100111111010011000110100000111111001111110011111100111111001111110011111100111111001111110011111101001100 3f3f3f3f3f3f3f3f3f4c683f3f3f3f3f3f3f3f3f4c
SJIS-WIN ???泣?????Lh???泣?????L 0011111100111111001111111000101110000011001111110011111100111111001111110011111101001100011010000011111100111111001111111000101110000011001111110011111100111111001111110011111101001100 3f3f3f8b833f3f3f3f3f4c683f3f3f8b833f3f3f3f3f4c
EUC-JP 濚??泣?????Lh濚??泣?????L 100011111100100110100001001111110011111110110101111000110011111100111111001111110011111100111111010011000110100010001111110010011010000100111111001111111011010111100011001111110011111100111111001111110011111101001100 8fc9a13f3fb5e33f3f3f3f3f4c688fc9a13f3fb5e33f3f3f3f3f4c
UTF-8 濚뱀슱泣먬펶琉꾩댅Lh濚뱀슱泣먬펶琉꾩댅L 111001101011111110011010111010111011000110000000111011001000101010110001111001101011001110100011111010111010100010101100111011011000111010110110111011111010011110001100111010101011111010101001111010111000110010000101010011000110100011100110101111111001101011101011101100011000000011101100100010101011000111100110101100111010001111101011101010001010110011101101100011101011011011101111101001111000110011101010101111101010100111101011100011001000010101001100 e6bf9aebb180ec8ab1e6b3a3eba8aced8eb6efa78ceabea9eb8c854c68e6bf9aebb180ec8ab1e6b3a3eba8aced8eb6efa78ceabea9eb8c854c
UHC 濚뱀슱泣먬펶琉꾩댅Lh濚뱀슱泣먬펶琉꾩댅L 111001111011100110111001111011001001101010111000111010111110100010010000111010011011110010000111111010111010010010000100111011001000100010101111010011000110100011100111101110011011100111101100100110101011100011101011111010001001000011101001101111001000011111101011101001001000010011101100100010001010111101001100 e7b9b9ec9ab8ebe890e9bc87eba484ec88af4c68e7b9b9ec9ab8ebe890e9bc87eba484ec88af4c

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)