To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????Lh?????????L 001111110011111100111111001111110011111100111111001111110011111100111111010011000110100000111111001111110011111100111111001111110011111100111111001111110011111101001100 3f3f3f3f3f3f3f3f3f4c683f3f3f3f3f3f3f3f3f4c
SJIS-WIN 筌k?爰??儀??Lh筌k?爰??儀??L 1110001010100011100000101000101100111111111000001010011100111111001111111000101101010110001111110011111101001100011010001110001010100011100000101000101100111111111000001010011100111111001111111000101101010110001111110011111101001100 e2a3828b3fe0a73f3f8b563f3f4c68e2a3828b3fe0a73f3f8b563f3f4c
EUC-JP 筌k?爰??儀??Lh筌k?爰??儀??L 1110010010100101101000111110101100111111111000001010100100111111001111111011010110110111001111110011111101001100011010001110010010100101101000111110101100111111111000001010100100111111001111111011010110110111001111110011111101001100 e4a5a3eb3fe0a93f3fb5b73f3f4c68e4a5a3eb3fe0a93f3fb5b73f3f4c
UTF-8 筌k똻爰쇽㏊儀쀬뒍Lh筌k똻爰쇽㏊儀쀬뒍L 111001111010110110001100111011111011110110001011111010111001100010111011111001111000100010110000111011001000011110111101111000111000111110001010111001011000010010000000111011001000000010101100111010111001001010001101010011000110100011100111101011011000110011101111101111011000101111101011100110001011101111100111100010001011000011101100100001111011110111100011100011111000101011100101100001001000000011101100100000001010110011101011100100101000110101001100 e7ad8cefbd8beb98bbe788b0ec87bde38f8ae58480ec80aceb928d4c68e7ad8cefbd8beb98bbe788b0ec87bde38f8ae58480ec80aceb928d4c
UHC 筌k똻爰쇽㏊儀쀬뒍Lh筌k똻爰쇽㏊儀쀬뒍L 111011111010011110100011111010111000110010000001111010101011101010111100111011111010011110110101111010111111000010010111111011001000101010001010010011000110100011101111101001111010001111101011100011001000000111101010101110101011110011101111101001111011010111101011111100001001011111101100100010101000101001001100 efa7a3eb8c81eababcefa7b5ebf097ec8a8a4c68efa7a3eb8c81eababcefa7b5ebf097ec8a8a4c

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)