To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????z?????????zB 001111110011111100111111001111110011111100111111001111110011111100111111011110100011111100111111001111110011111100111111001111110011111100111111001111110111101001000010 3f3f3f3f3f3f3f3f3f7a3f3f3f3f3f3f3f3f3f7a42
SJIS-WIN 夜??揖??袁ъ?z夜??揖??袁ъ?zB 1001011011101001001111110011111110010111010010110011111100111111111001011100110110000100100011000011111101111010100101101110100100111111001111111001011101001011001111110011111111100101110011011000010010001100001111110111101001000010 96e93f3f974b3f3fe5cd848c3f7a96e93f3f974b3f3fe5cd848c3f7a42
EUC-JP 夜??揖??袁ъ?z夜??揖??袁ъ?zB 1100110011101011001111110011111111001101101011000011111100111111111010101100111110100111111011000011111101111010110011001110101100111111001111111100110110101100001111110011111111101010110011111010011111101100001111110111101001000010 cceb3f3fcdac3f3feacfa7ec3f7acceb3f3fcdac3f3feacfa7ec3f7a42
UTF-8 夜껊씛揖졿콢袁ъ돟z夜껊씛揖졿콢袁ъ돟zB 11100101101001001001110011101010101110111000101011101100100101001001101111100110100011111001011011101100101000011011111111101100101111011010001011101000101000101000000111010001100010101110101110001111100111110111101011100101101001001001110011101010101110111000101011101100100101001001101111100110100011111001011011101100101000011011111111101100101111011010001011101000101000101000000111010001100010101110101110001111100111110111101001000010 e5a49ceabb8aec949be68f96eca1bfecbda2e8a281d18aeb8f9f7ae5a49ceabb8aec949be68f96eca1bfecbda2e8a281d18aeb8f9f7a42
UHC 夜껊씛揖졿콢袁ъ돟z夜껊씛揖졿콢袁ъ돟zB 111001011010100010000011111010111001110110110000111010111110011110100000111001101011000110011010111010101011111010101100111011001000100110100101011110101110010110101000100000111110101110011101101100001110101111100111101000001110011010110001100110101110101010111110101011001110110010001001101001010111101001000010 e5a883eb9db0ebe7a0e6b19aeabeacec89a57ae5a883eb9db0ebe7a0e6b19aeabeacec89a57a42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)